Dataset statistics
| Number of variables | 35 |
|---|---|
| Number of observations | 569501 |
| Missing cells | 10462746 |
| Missing cells (%) | 52.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 GiB |
| Average record size in memory | 2.0 KiB |
Variable types
| CAT | 29 |
|---|---|
| NUM | 6 |
RECVDATE has a high cardinality: 260 distinct values | High cardinality |
STATE has a high cardinality: 64 distinct values | High cardinality |
RPT_DATE has a high cardinality: 109 distinct values | High cardinality |
SYMPTOM_TEXT has a high cardinality: 544604 distinct values | High cardinality |
DATEDIED has a high cardinality: 297 distinct values | High cardinality |
VAX_DATE has a high cardinality: 1701 distinct values | High cardinality |
ONSET_DATE has a high cardinality: 1126 distinct values | High cardinality |
LAB_DATA has a high cardinality: 125645 distinct values | High cardinality |
OTHER_MEDS has a high cardinality: 218020 distinct values | High cardinality |
CUR_ILL has a high cardinality: 51645 distinct values | High cardinality |
HISTORY has a high cardinality: 146407 distinct values | High cardinality |
PRIOR_VAX has a high cardinality: 23877 distinct values | High cardinality |
SPLTTYPE has a high cardinality: 77868 distinct values | High cardinality |
TODAYS_DATE has a high cardinality: 367 distinct values | High cardinality |
ALLERGIES has a high cardinality: 99925 distinct values | High cardinality |
CAGE_YR is highly correlated with AGE_YRS | High correlation |
AGE_YRS is highly correlated with CAGE_YR | High correlation |
STATE has 69104 (12.1%) missing values | Missing |
AGE_YRS has 58342 (10.2%) missing values | Missing |
CAGE_YR has 110446 (19.4%) missing values | Missing |
CAGE_MO has 567757 (99.7%) missing values | Missing |
RPT_DATE has 569151 (99.9%) missing values | Missing |
DIED has 562319 (98.7%) missing values | Missing |
DATEDIED has 563051 (98.9%) missing values | Missing |
L_THREAT has 560732 (98.5%) missing values | Missing |
ER_VISIT has 569449 (> 99.9%) missing values | Missing |
HOSPITAL has 536233 (94.2%) missing values | Missing |
HOSPDAYS has 546713 (96.0%) missing values | Missing |
X_STAY has 569195 (99.9%) missing values | Missing |
DISABLE has 560782 (98.5%) missing values | Missing |
RECOVD has 49549 (8.7%) missing values | Missing |
VAX_DATE has 39266 (6.9%) missing values | Missing |
ONSET_DATE has 44284 (7.8%) missing values | Missing |
NUMDAYS has 66139 (11.6%) missing values | Missing |
LAB_DATA has 346365 (60.8%) missing values | Missing |
V_FUNDBY has 569108 (99.9%) missing values | Missing |
OTHER_MEDS has 234118 (41.1%) missing values | Missing |
CUR_ILL has 304713 (53.5%) missing values | Missing |
HISTORY has 211582 (37.2%) missing values | Missing |
PRIOR_VAX has 542711 (95.3%) missing values | Missing |
SPLTTYPE has 404977 (71.1%) missing values | Missing |
BIRTH_DEFECT has 569178 (99.9%) missing values | Missing |
OFC_VISIT has 462554 (81.2%) missing values | Missing |
ER_ED_VISIT has 499046 (87.6%) missing values | Missing |
ALLERGIES has 273103 (48.0%) missing values | Missing |
HOSPDAYS is highly skewed (γ1 = 106.7097018) | Skewed |
NUMDAYS is highly skewed (γ1 = 46.71933369) | Skewed |
VAERS_ID has unique values | Unique |
NUMDAYS has 221204 (38.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-09-30 16:55:38.020739 |
|---|---|
| Analysis finished | 2021-09-30 16:56:40.165152 |
| Duration | 1 minute and 2.14 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 569501 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1281987.761 |
|---|---|
| Minimum | 916600 |
| Maximum | 1708053 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 916600 |
|---|---|
| 5-th percentile | 947023 |
| Q1 | 1088223 |
| median | 1258457 |
| Q3 | 1482806 |
| 95-th percentile | 1652383 |
| Maximum | 1708053 |
| Range | 791453 |
| Interquartile range (IQR) | 394583 |
Descriptive statistics
| Standard deviation | 228472.5632 |
|---|---|
| Coefficient of variation (CV) | 0.1782174293 |
| Kurtosis | -1.159440595 |
| Mean | 1281987.761 |
| Median Absolute Deviation (MAD) | 187949 |
| Skewness | 0.1889093472 |
| Sum | 7.30093312e+11 |
| Variance | 5.219971213e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1052674 | 1 | < 0.1% | |
| 1055574 | 1 | < 0.1% | |
| 1104718 | 1 | < 0.1% | |
| 1102671 | 1 | < 0.1% | |
| 1059664 | 1 | < 0.1% | |
| 1063762 | 1 | < 0.1% | |
| 1051476 | 1 | < 0.1% | |
| 1049429 | 1 | < 0.1% | |
| 1076056 | 1 | < 0.1% | |
| 1162082 | 1 | < 0.1% | |
| 1074009 | 1 | < 0.1% | |
| 1080154 | 1 | < 0.1% | |
| 1078107 | 1 | < 0.1% | |
| 1071966 | 1 | < 0.1% | |
| 1069919 | 1 | < 0.1% | |
| 1157984 | 1 | < 0.1% | |
| 1098573 | 1 | < 0.1% | |
| 1100620 | 1 | < 0.1% | |
| 1106761 | 1 | < 0.1% | |
| 1108808 | 1 | < 0.1% | |
| 1086279 | 1 | < 0.1% | |
| 1088326 | 1 | < 0.1% | |
| 1082181 | 1 | < 0.1% | |
| 1084228 | 1 | < 0.1% | |
| 1094467 | 1 | < 0.1% | |
| Other values (569476) | 569476 | > 99.9% |
| Value | Count | Frequency (%) | |
| 916600 | 1 | < 0.1% | |
| 916601 | 1 | < 0.1% | |
| 916602 | 1 | < 0.1% | |
| 916603 | 1 | < 0.1% | |
| 916604 | 1 | < 0.1% | |
| 916605 | 1 | < 0.1% | |
| 916606 | 1 | < 0.1% | |
| 916607 | 1 | < 0.1% | |
| 916608 | 1 | < 0.1% | |
| 916609 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1708053 | 1 | < 0.1% | |
| 1708052 | 1 | < 0.1% | |
| 1708050 | 1 | < 0.1% | |
| 1708049 | 1 | < 0.1% | |
| 1708048 | 1 | < 0.1% | |
| 1708047 | 1 | < 0.1% | |
| 1708046 | 1 | < 0.1% | |
| 1708045 | 1 | < 0.1% | |
| 1708044 | 1 | < 0.1% | |
| 1708043 | 1 | < 0.1% |
| Distinct | 260 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.3 MiB |
| 08/22/2021 | 14600 |
|---|---|
| 08/15/2021 | 12189 |
| 08/21/2021 | 10509 |
| 08/28/2021 | 9970 |
| 04/13/2021 | 5742 |
| Other values (255) |
| Value | Count | Frequency (%) | |
| 08/22/2021 | 14600 | 2.6% | |
| 08/15/2021 | 12189 | 2.1% | |
| 08/21/2021 | 10509 | 1.8% | |
| 08/28/2021 | 9970 | 1.8% | |
| 04/13/2021 | 5742 | 1.0% | |
| 08/23/2021 | 4692 | 0.8% | |
| 04/16/2021 | 4574 | 0.8% | |
| 04/14/2021 | 4548 | 0.8% | |
| 08/13/2021 | 4176 | 0.7% | |
| 04/09/2021 | 4139 | 0.7% | |
| 04/07/2021 | 4136 | 0.7% | |
| 04/15/2021 | 4019 | 0.7% | |
| 04/22/2021 | 3991 | 0.7% | |
| 04/08/2021 | 3940 | 0.7% | |
| 04/12/2021 | 3913 | 0.7% | |
| 04/23/2021 | 3807 | 0.7% | |
| 03/31/2021 | 3732 | 0.7% | |
| 04/24/2021 | 3717 | 0.7% | |
| 04/21/2021 | 3715 | 0.7% | |
| 01/08/2021 | 3715 | 0.7% | |
| 04/27/2021 | 3698 | 0.6% | |
| 08/11/2021 | 3682 | 0.6% | |
| 04/01/2021 | 3601 | 0.6% | |
| 01/28/2021 | 3585 | 0.6% | |
| 04/10/2021 | 3579 | 0.6% | |
| Other values (235) | 437532 | 76.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 1454718 | 25.5% | |
| 0 | 1338789 | 23.5% | |
| / | 1139002 | 20.0% | |
| 1 | 910600 | 16.0% | |
| 8 | 165571 | 2.9% | |
| 4 | 156198 | 2.7% | |
| 3 | 149929 | 2.6% | |
| 5 | 126369 | 2.2% | |
| 6 | 94548 | 1.7% | |
| 7 | 85496 | 1.5% | |
| 9 | 73790 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4556008 | 80.0% | |
| Other Punctuation | 1139002 | 20.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 1454718 | 31.9% | |
| 0 | 1338789 | 29.4% | |
| 1 | 910600 | 20.0% | |
| 8 | 165571 | 3.6% | |
| 4 | 156198 | 3.4% | |
| 3 | 149929 | 3.3% | |
| 5 | 126369 | 2.8% | |
| 6 | 94548 | 2.1% | |
| 7 | 85496 | 1.9% | |
| 9 | 73790 | 1.6% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1139002 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5695010 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 1454718 | 25.5% | |
| 0 | 1338789 | 23.5% | |
| / | 1139002 | 20.0% | |
| 1 | 910600 | 16.0% | |
| 8 | 165571 | 2.9% | |
| 4 | 156198 | 2.7% | |
| 3 | 149929 | 2.6% | |
| 5 | 126369 | 2.2% | |
| 6 | 94548 | 1.7% | |
| 7 | 85496 | 1.5% | |
| 9 | 73790 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5695010 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 1454718 | 25.5% | |
| 0 | 1338789 | 23.5% | |
| / | 1139002 | 20.0% | |
| 1 | 910600 | 16.0% | |
| 8 | 165571 | 2.9% | |
| 4 | 156198 | 2.7% | |
| 3 | 149929 | 2.6% | |
| 5 | 126369 | 2.2% | |
| 6 | 94548 | 1.7% | |
| 7 | 85496 | 1.5% | |
| 9 | 73790 | 1.3% |
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 69104 |
| Missing (%) | 12.1% |
| Memory size | 4.3 MiB |
| CA | |
|---|---|
| FL | |
| TX | 31950 |
| NY | 30836 |
| IN | 21752 |
| Other values (59) |
| Value | Count | Frequency (%) | |
| CA | 55566 | 9.8% | |
| FL | 33734 | 5.9% | |
| TX | 31950 | 5.6% | |
| NY | 30836 | 5.4% | |
| IN | 21752 | 3.8% | |
| PA | 20970 | 3.7% | |
| IL | 18278 | 3.2% | |
| OH | 17101 | 3.0% | |
| MI | 16717 | 2.9% | |
| NJ | 16014 | 2.8% | |
| NC | 14609 | 2.6% | |
| WA | 13394 | 2.4% | |
| VA | 13346 | 2.3% | |
| MA | 13191 | 2.3% | |
| GA | 12785 | 2.2% | |
| AZ | 12333 | 2.2% | |
| MD | 11278 | 2.0% | |
| MN | 10963 | 1.9% | |
| CO | 10530 | 1.8% | |
| WI | 9991 | 1.8% | |
| MO | 8584 | 1.5% | |
| TN | 8239 | 1.4% | |
| OR | 7713 | 1.4% | |
| CT | 7409 | 1.3% | |
| KY | 6685 | 1.2% | |
| Other values (39) | 76429 | 13.4% | |
| (Missing) | 69104 | 12.1% |
Frequencies of value counts
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.121341315 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| A | 160076 | 13.3% | |
| n | 138208 | 11.4% | |
| N | 116310 | 9.6% | |
| C | 95353 | 7.9% | |
| I | 77333 | 6.4% | |
| M | 71701 | 5.9% | |
| a | 69105 | 5.7% | |
| L | 61388 | 5.1% | |
| T | 55104 | 4.6% | |
| O | 49611 | 4.1% | |
| Y | 38299 | 3.2% | |
| F | 33737 | 2.8% | |
| X | 31958 | 2.6% | |
| W | 26287 | 2.2% | |
| P | 23089 | 1.9% | |
| H | 22042 | 1.8% | |
| V | 20921 | 1.7% | |
| D | 19018 | 1.6% | |
| K | 18066 | 1.5% | |
| J | 16014 | 1.3% | |
| R | 14981 | 1.2% | |
| S | 13392 | 1.1% | |
| G | 12863 | 1.1% | |
| Z | 12333 | 1.0% | |
| E | 6992 | 0.6% | |
| Other values (4) | 3925 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 1000792 | 82.8% | |
| Lowercase Letter | 207314 | 17.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 160076 | 16.0% | |
| N | 116310 | 11.6% | |
| C | 95353 | 9.5% | |
| I | 77333 | 7.7% | |
| M | 71701 | 7.2% | |
| L | 61388 | 6.1% | |
| T | 55104 | 5.5% | |
| O | 49611 | 5.0% | |
| Y | 38299 | 3.8% | |
| F | 33737 | 3.4% | |
| X | 31958 | 3.2% | |
| W | 26287 | 2.6% | |
| P | 23089 | 2.3% | |
| H | 22042 | 2.2% | |
| V | 20921 | 2.1% | |
| D | 19018 | 1.9% | |
| K | 18066 | 1.8% | |
| J | 16014 | 1.6% | |
| R | 14981 | 1.5% | |
| S | 13392 | 1.3% | |
| G | 12863 | 1.3% | |
| Z | 12333 | 1.2% | |
| E | 6992 | 0.7% | |
| U | 3917 | 0.4% | |
| B | 5 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 138208 | 66.7% | |
| a | 69105 | 33.3% | |
| x | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1208106 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| A | 160076 | 13.3% | |
| n | 138208 | 11.4% | |
| N | 116310 | 9.6% | |
| C | 95353 | 7.9% | |
| I | 77333 | 6.4% | |
| M | 71701 | 5.9% | |
| a | 69105 | 5.7% | |
| L | 61388 | 5.1% | |
| T | 55104 | 4.6% | |
| O | 49611 | 4.1% | |
| Y | 38299 | 3.2% | |
| F | 33737 | 2.8% | |
| X | 31958 | 2.6% | |
| W | 26287 | 2.2% | |
| P | 23089 | 1.9% | |
| H | 22042 | 1.8% | |
| V | 20921 | 1.7% | |
| D | 19018 | 1.6% | |
| K | 18066 | 1.5% | |
| J | 16014 | 1.3% | |
| R | 14981 | 1.2% | |
| S | 13392 | 1.1% | |
| G | 12863 | 1.1% | |
| Z | 12333 | 1.0% | |
| E | 6992 | 0.6% | |
| Other values (4) | 3925 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1208106 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| A | 160076 | 13.3% | |
| n | 138208 | 11.4% | |
| N | 116310 | 9.6% | |
| C | 95353 | 7.9% | |
| I | 77333 | 6.4% | |
| M | 71701 | 5.9% | |
| a | 69105 | 5.7% | |
| L | 61388 | 5.1% | |
| T | 55104 | 4.6% | |
| O | 49611 | 4.1% | |
| Y | 38299 | 3.2% | |
| F | 33737 | 2.8% | |
| X | 31958 | 2.6% | |
| W | 26287 | 2.2% | |
| P | 23089 | 1.9% | |
| H | 22042 | 1.8% | |
| V | 20921 | 1.7% | |
| D | 19018 | 1.6% | |
| K | 18066 | 1.5% | |
| J | 16014 | 1.3% | |
| R | 14981 | 1.2% | |
| S | 13392 | 1.1% | |
| G | 12863 | 1.1% | |
| Z | 12333 | 1.0% | |
| E | 6992 | 0.6% | |
| Other values (4) | 3925 | 0.3% |
| Distinct | 144 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 58342 |
| Missing (%) | 10.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.3766859 |
|---|---|
| Minimum | 0 |
| Maximum | 119 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 35 |
| median | 50 |
| Q3 | 64 |
| 95-th percentile | 79 |
| Maximum | 119 |
| Range | 119 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 18.82628916 |
|---|---|
| Coefficient of variation (CV) | 0.3812789137 |
| Kurtosis | -0.7547235061 |
| Mean | 49.3766859 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.01087300755 |
| Sum | 25239337.39 |
| Variance | 354.4291637 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 50 | 9716 | 1.7% | |
| 65 | 9509 | 1.7% | |
| 60 | 9203 | 1.6% | |
| 51 | 9169 | 1.6% | |
| 58 | 9144 | 1.6% | |
| 59 | 9076 | 1.6% | |
| 56 | 8962 | 1.6% | |
| 57 | 8911 | 1.6% | |
| 40 | 8900 | 1.6% | |
| 66 | 8893 | 1.6% | |
| 38 | 8892 | 1.6% | |
| 39 | 8863 | 1.6% | |
| 61 | 8797 | 1.5% | |
| 37 | 8779 | 1.5% | |
| 49 | 8776 | 1.5% | |
| 41 | 8763 | 1.5% | |
| 36 | 8689 | 1.5% | |
| 62 | 8608 | 1.5% | |
| 52 | 8608 | 1.5% | |
| 55 | 8591 | 1.5% | |
| 63 | 8573 | 1.5% | |
| 42 | 8526 | 1.5% | |
| 67 | 8503 | 1.5% | |
| 43 | 8493 | 1.5% | |
| 35 | 8488 | 1.5% | |
| Other values (119) | 289727 | 50.9% | |
| (Missing) | 58342 | 10.2% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 0.08 | 45 | < 0.1% | |
| 0.17 | 129 | < 0.1% | |
| 0.25 | 26 | < 0.1% | |
| 0.33 | 89 | < 0.1% | |
| 0.42 | 23 | < 0.1% | |
| 0.5 | 66 | < 0.1% | |
| 0.58 | 40 | < 0.1% | |
| 0.67 | 13 | < 0.1% | |
| 0.75 | 18 | < 0.1% |
| Value | Count | Frequency (%) | |
| 119 | 3 | < 0.1% | |
| 115 | 5 | < 0.1% | |
| 113 | 1 | < 0.1% | |
| 109 | 1 | < 0.1% | |
| 106 | 2 | < 0.1% | |
| 105 | 6 | < 0.1% | |
| 104 | 4 | < 0.1% | |
| 103 | 18 | < 0.1% | |
| 102 | 24 | < 0.1% | |
| 101 | 49 | < 0.1% |
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 110446 |
| Missing (%) | 19.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.04191437 |
|---|---|
| Minimum | 0 |
| Maximum | 120 |
| Zeros | 1344 |
| Zeros (%) | 0.2% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 34 |
| median | 49 |
| Q3 | 64 |
| 95-th percentile | 79 |
| Maximum | 120 |
| Range | 120 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 18.97204892 |
|---|---|
| Coefficient of variation (CV) | 0.3868537589 |
| Kurtosis | -0.7096830573 |
| Mean | 49.04191437 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.006852441575 |
| Sum | 22512936 |
| Variance | 359.9386402 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 50 | 8631 | 1.5% | |
| 65 | 8415 | 1.5% | |
| 51 | 8151 | 1.4% | |
| 60 | 8142 | 1.4% | |
| 38 | 8108 | 1.4% | |
| 58 | 8097 | 1.4% | |
| 59 | 8002 | 1.4% | |
| 57 | 7999 | 1.4% | |
| 40 | 7964 | 1.4% | |
| 56 | 7963 | 1.4% | |
| 39 | 7956 | 1.4% | |
| 37 | 7951 | 1.4% | |
| 66 | 7941 | 1.4% | |
| 41 | 7939 | 1.4% | |
| 36 | 7877 | 1.4% | |
| 61 | 7833 | 1.4% | |
| 49 | 7824 | 1.4% | |
| 35 | 7685 | 1.3% | |
| 55 | 7654 | 1.3% | |
| 52 | 7638 | 1.3% | |
| 42 | 7629 | 1.3% | |
| 62 | 7622 | 1.3% | |
| 34 | 7619 | 1.3% | |
| 63 | 7617 | 1.3% | |
| 43 | 7571 | 1.3% | |
| Other values (90) | 261227 | 45.9% | |
| (Missing) | 110446 | 19.4% |
| Value | Count | Frequency (%) | |
| 0 | 1344 | 0.2% | |
| 1 | 347 | 0.1% | |
| 2 | 53 | < 0.1% | |
| 3 | 38 | < 0.1% | |
| 4 | 130 | < 0.1% | |
| 5 | 54 | < 0.1% | |
| 6 | 22 | < 0.1% | |
| 7 | 38 | < 0.1% | |
| 8 | 30 | < 0.1% | |
| 9 | 38 | < 0.1% |
| Value | Count | Frequency (%) | |
| 120 | 41 | < 0.1% | |
| 119 | 7 | < 0.1% | |
| 118 | 4 | < 0.1% | |
| 117 | 1 | < 0.1% | |
| 113 | 1 | < 0.1% | |
| 112 | 2 | < 0.1% | |
| 109 | 1 | < 0.1% | |
| 108 | 1 | < 0.1% | |
| 106 | 1 | < 0.1% | |
| 105 | 7 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 567757 |
| Missing (%) | 99.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1646788991 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 957 |
| Zeros (%) | 0.2% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.3 |
| 95-th percentile | 0.7 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.3 |
Descriptive statistics
| Standard deviation | 0.2375339698 |
|---|---|
| Coefficient of variation (CV) | 1.442406836 |
| Kurtosis | 1.644515416 |
| Mean | 0.1646788991 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.51404745 |
| Sum | 287.2 |
| Variance | 0.05642238679 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) | |
| 0 | 957 | 0.2% | |
| 0.2 | 179 | < 0.1% | |
| 0.3 | 151 | < 0.1% | |
| 0.1 | 128 | < 0.1% | |
| 0.5 | 106 | < 0.1% | |
| 0.4 | 74 | < 0.1% | |
| 0.6 | 57 | < 0.1% | |
| 0.7 | 28 | < 0.1% | |
| 0.8 | 26 | < 0.1% | |
| 1 | 19 | < 0.1% | |
| 0.9 | 19 | < 0.1% | |
| (Missing) | 567757 | 99.7% |
| Value | Count | Frequency (%) | |
| 0 | 957 | 0.2% | |
| 0.1 | 128 | < 0.1% | |
| 0.2 | 179 | < 0.1% | |
| 0.3 | 151 | < 0.1% | |
| 0.4 | 74 | < 0.1% | |
| 0.5 | 106 | < 0.1% | |
| 0.6 | 57 | < 0.1% | |
| 0.7 | 28 | < 0.1% | |
| 0.8 | 26 | < 0.1% | |
| 0.9 | 19 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 19 | < 0.1% | |
| 0.9 | 19 | < 0.1% | |
| 0.8 | 26 | < 0.1% | |
| 0.7 | 28 | < 0.1% | |
| 0.6 | 57 | < 0.1% | |
| 0.5 | 106 | < 0.1% | |
| 0.4 | 74 | < 0.1% | |
| 0.3 | 151 | < 0.1% | |
| 0.2 | 179 | < 0.1% | |
| 0.1 | 128 | < 0.1% |
SEX
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.3 MiB |
| F | |
|---|---|
| M | |
| U | 21014 |
| Value | Count | Frequency (%) | |
| F | 388350 | 68.2% | |
| M | 160137 | 28.1% | |
| U | 21014 | 3.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| F | 388350 | 68.2% | |
| M | 160137 | 28.1% | |
| U | 21014 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 569501 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| F | 388350 | 68.2% | |
| M | 160137 | 28.1% | |
| U | 21014 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 569501 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| F | 388350 | 68.2% | |
| M | 160137 | 28.1% | |
| U | 21014 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 569501 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| F | 388350 | 68.2% | |
| M | 160137 | 28.1% | |
| U | 21014 | 3.7% |
| Distinct | 109 |
|---|---|
| Distinct (%) | 31.1% |
| Missing | 569151 |
| Missing (%) | 99.9% |
| Memory size | 4.3 MiB |
| 01/08/2021 | 22 |
|---|---|
| 01/11/2021 | 19 |
| 01/13/2021 | 14 |
| 01/12/2021 | 14 |
| 02/04/2021 | 13 |
| Other values (104) |
| Value | Count | Frequency (%) | |
| 01/08/2021 | 22 | < 0.1% | |
| 01/11/2021 | 19 | < 0.1% | |
| 01/13/2021 | 14 | < 0.1% | |
| 01/12/2021 | 14 | < 0.1% | |
| 02/04/2021 | 13 | < 0.1% | |
| 01/06/2021 | 13 | < 0.1% | |
| 01/05/2021 | 9 | < 0.1% | |
| 02/08/2021 | 9 | < 0.1% | |
| 01/04/2021 | 9 | < 0.1% | |
| 12/31/2020 | 8 | < 0.1% | |
| 01/27/2021 | 8 | < 0.1% | |
| 01/07/2021 | 8 | < 0.1% | |
| 05/22/2021 | 7 | < 0.1% | |
| 01/20/2021 | 7 | < 0.1% | |
| 01/28/2021 | 7 | < 0.1% | |
| 02/10/2021 | 6 | < 0.1% | |
| 01/21/2021 | 6 | < 0.1% | |
| 12/29/2020 | 5 | < 0.1% | |
| 01/29/2021 | 5 | < 0.1% | |
| 02/01/2021 | 5 | < 0.1% | |
| 02/11/2021 | 5 | < 0.1% | |
| 12/18/2020 | 5 | < 0.1% | |
| 01/22/2021 | 4 | < 0.1% | |
| 02/18/2021 | 4 | < 0.1% | |
| 02/12/2021 | 4 | < 0.1% | |
| Other values (84) | 134 | < 0.1% | |
| (Missing) | 569151 | 99.9% |
Frequencies of value counts
Unique
| Unique | 52 ? |
|---|---|
| Unique (%) | 14.9% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 3 |
| Mean length | 3.004302012 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1138302 | 66.5% | |
| a | 569151 | 33.3% | |
| 2 | 926 | 0.1% | |
| 0 | 858 | 0.1% | |
| 1 | 705 | < 0.1% | |
| / | 700 | < 0.1% | |
| 3 | 63 | < 0.1% | |
| 8 | 61 | < 0.1% | |
| 5 | 51 | < 0.1% | |
| 4 | 42 | < 0.1% | |
| 6 | 37 | < 0.1% | |
| 9 | 31 | < 0.1% | |
| 7 | 26 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1707453 | 99.8% | |
| Decimal Number | 2800 | 0.2% | |
| Other Punctuation | 700 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1138302 | 66.7% | |
| a | 569151 | 33.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 926 | 33.1% | |
| 0 | 858 | 30.6% | |
| 1 | 705 | 25.2% | |
| 3 | 63 | 2.2% | |
| 8 | 61 | 2.2% | |
| 5 | 51 | 1.8% | |
| 4 | 42 | 1.5% | |
| 6 | 37 | 1.3% | |
| 9 | 31 | 1.1% | |
| 7 | 26 | 0.9% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 700 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1707453 | 99.8% | |
| Common | 3500 | 0.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1138302 | 66.7% | |
| a | 569151 | 33.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 926 | 26.5% | |
| 0 | 858 | 24.5% | |
| 1 | 705 | 20.1% | |
| / | 700 | 20.0% | |
| 3 | 63 | 1.8% | |
| 8 | 61 | 1.7% | |
| 5 | 51 | 1.5% | |
| 4 | 42 | 1.2% | |
| 6 | 37 | 1.1% | |
| 9 | 31 | 0.9% | |
| 7 | 26 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1710953 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1138302 | 66.5% | |
| a | 569151 | 33.3% | |
| 2 | 926 | 0.1% | |
| 0 | 858 | 0.1% | |
| 1 | 705 | < 0.1% | |
| / | 700 | < 0.1% | |
| 3 | 63 | < 0.1% | |
| 8 | 61 | < 0.1% | |
| 5 | 51 | < 0.1% | |
| 4 | 42 | < 0.1% | |
| 6 | 37 | < 0.1% | |
| 9 | 31 | < 0.1% | |
| 7 | 26 | < 0.1% |
| Distinct | 544604 |
|---|---|
| Distinct (%) | 95.6% |
| Missing | 129 |
| Missing (%) | < 0.1% |
| Memory size | 4.3 MiB |
| Error: Improper Storage (temperature) | 837 |
|---|---|
| None stated. | 773 |
| Administered vials that were exposed to room temperature for more than 12 hours; A spontaneous report was received from an employee and a physician concerning a patient, who received Moderna's COVID-19 vaccine (mRNA-1273) and was administered with product that was exposed to room temperature for more than twelve hours. The patient's medical history was not provided. No relevant concomitant medications were reported. On 04 Jan 2021, a freezer containing a vial of mRNA-1273 failed. At 1:11 A.M. the vial experienced a temperature excursion, exceeding 8 degrees Celsius. Over time the dose thawed and reached room temperature. On 04 Jan 2021, the patient received their first of two planned doses of mRNA-1273 intramuscularly for prophylaxis of COVID-19 infection and was administered with product that was exposed to room temperature for more than twelve hours. No treatment information was provided. Action taken with mRNA-1273 in response to the event was not reported. The event, administered with product that was exposed to room temperature for more than twelve hours, was resolved on 04 Jan 2021.; Reporter's Comments: This case concerns a patient of unknown gender and age who received their first of two planned doses of mRNA-1273 (Lot unknown), reporting Product that was exposed to room temperature for more than twelve hours without any associated adverse events. | 769 |
| Pfizer vaccine administered after being stored at regular freezer temps longer than recommended. Dose determined invalid, client contacted and recommended repeat dose. | 541 |
| Vaccinated with vial that might have had a temperature excursion; Vaccinated with vial that might have had a temperature excursion; Vaccinated with vial that might have had a temperature excursion; A spontaneous report was received from a nurse concerning a patient who received Moderna's COVID-19 vaccine (mRNA-1273) and was vaccinated with vial that might have had a temperature excursion. The patient's medical history was not provided. No relevant concomitant medications were reported. On 22 Dec 2020, the nurse reported a shipment was received. The vial arrived frozen and was placed in a freezer at recommended temperature. On 02 Jan 2021, the freezer had failed, and the temperature alarm system did not alert anyone. It was noted that the freezer temperature at 5:50 AM was -5 degrees Celsius (C), at 7:50 AM the freezer was a 1.5 C. It remained between from 9.7 C at 12:51 PM then went to 8.3 C at 1:51 PM then down to -8.7 C at 2:51 PM. On 03 Jan 2021 8:45 PM, the freezer returned to its normal temperature of -20.9 C, -1 C then went to 5.5 C at 11:56 PM. On 04 Jan 2021, the temperature climbed to 19.4 C. On the same day, the patient received their first of two planned doses of mRNA-1273 (Lot number: 025J20-2A, 025L20A, or 027L20A) intramuscularly for prophylaxis of COVID-19 infection and experienced vaccination with vial that might have had a temperature excursion No treatment information was provided. Action taken with mRNA-1273 in response to the event was not reported. The outcome of the event, vaccinated with vial that might have had a temperature excursion, was considered resolved on 04 Jan 2021.; Reporter's Comments: This report refers to a case of out of specification product use, product temperature excursion issue, and product storage error for mRNA-1273. There were no reported AEs associated with this case. | 466 |
| Other values (544599) |
| Value | Count | Frequency (%) | |
| Error: Improper Storage (temperature) | 837 | 0.1% | |
| None stated. | 773 | 0.1% | |
| Administered vials that were exposed to room temperature for more than 12 hours; A spontaneous report was received from an employee and a physician concerning a patient, who received Moderna's COVID-19 vaccine (mRNA-1273) and was administered with product that was exposed to room temperature for more than twelve hours. The patient's medical history was not provided. No relevant concomitant medications were reported. On 04 Jan 2021, a freezer containing a vial of mRNA-1273 failed. At 1:11 A.M. the vial experienced a temperature excursion, exceeding 8 degrees Celsius. Over time the dose thawed and reached room temperature. On 04 Jan 2021, the patient received their first of two planned doses of mRNA-1273 intramuscularly for prophylaxis of COVID-19 infection and was administered with product that was exposed to room temperature for more than twelve hours. No treatment information was provided. Action taken with mRNA-1273 in response to the event was not reported. The event, administered with product that was exposed to room temperature for more than twelve hours, was resolved on 04 Jan 2021.; Reporter's Comments: This case concerns a patient of unknown gender and age who received their first of two planned doses of mRNA-1273 (Lot unknown), reporting Product that was exposed to room temperature for more than twelve hours without any associated adverse events. | 769 | 0.1% | |
| Pfizer vaccine administered after being stored at regular freezer temps longer than recommended. Dose determined invalid, client contacted and recommended repeat dose. | 541 | 0.1% | |
| Vaccinated with vial that might have had a temperature excursion; Vaccinated with vial that might have had a temperature excursion; Vaccinated with vial that might have had a temperature excursion; A spontaneous report was received from a nurse concerning a patient who received Moderna's COVID-19 vaccine (mRNA-1273) and was vaccinated with vial that might have had a temperature excursion. The patient's medical history was not provided. No relevant concomitant medications were reported. On 22 Dec 2020, the nurse reported a shipment was received. The vial arrived frozen and was placed in a freezer at recommended temperature. On 02 Jan 2021, the freezer had failed, and the temperature alarm system did not alert anyone. It was noted that the freezer temperature at 5:50 AM was -5 degrees Celsius (C), at 7:50 AM the freezer was a 1.5 C. It remained between from 9.7 C at 12:51 PM then went to 8.3 C at 1:51 PM then down to -8.7 C at 2:51 PM. On 03 Jan 2021 8:45 PM, the freezer returned to its normal temperature of -20.9 C, -1 C then went to 5.5 C at 11:56 PM. On 04 Jan 2021, the temperature climbed to 19.4 C. On the same day, the patient received their first of two planned doses of mRNA-1273 (Lot number: 025J20-2A, 025L20A, or 027L20A) intramuscularly for prophylaxis of COVID-19 infection and experienced vaccination with vial that might have had a temperature excursion No treatment information was provided. Action taken with mRNA-1273 in response to the event was not reported. The outcome of the event, vaccinated with vial that might have had a temperature excursion, was considered resolved on 04 Jan 2021.; Reporter's Comments: This report refers to a case of out of specification product use, product temperature excursion issue, and product storage error for mRNA-1273. There were no reported AEs associated with this case. | 466 | 0.1% | |
| vaccine expired in fridge | 379 | 0.1% | |
| temperature excursion with vaccine administered | 378 | 0.1% | |
| Error: Patient Too Young for Vaccine Administered- | 331 | 0.1% | |
| Error: Wrong Dose of Vaccine - Too Low | 328 | 0.1% | |
| Error: Improper Storage (temperature)- | 314 | 0.1% | |
| Patient had an ED visit and/or hospitalization within 6 weeks of receiving COVID vaccine. | 306 | 0.1% | |
| Error: Wrong Dose of Vaccine - Too High | 304 | 0.1% | |
| Error: Incorrect Reconstitution | 247 | < 0.1% | |
| Tinnitus | 223 | < 0.1% | |
| Shingles | 214 | < 0.1% | |
| Hospitalization within 6 weeks after receiving vaccine | 208 | < 0.1% | |
| CDC recommends using single-use vials of normal saline to dilute to each vial of the Pfizer vaccine. Staff member incorrectly used 100 ml bottles of normal saline to reconstitute multiple vials of vaccine each day. Staff member used a new bottle per day, however, and used the correct amount of normal saline to dilute each vial (1.8 ml per vial). We have since corrected this error and also notified local health authorities. Patient has not reported any adverse events to health clinic where error occurred. | 203 | < 0.1% | |
| unknown | 179 | < 0.1% | |
| None | 177 | < 0.1% | |
| Pfizer Vaccine administered after being stored at regular freezer temps longer than recommended. Dose determined invalid, client contacted, and recommended dose repeated. | 168 | < 0.1% | |
| Error: Wrong Vaccine Formulation (ex. different manufact. initial and booster)- | 151 | < 0.1% | |
| NONE | 133 | < 0.1% | |
| Error: Booster Given Too Early | 133 | < 0.1% | |
| Death | 129 | < 0.1% | |
| Vaccinated with vial that might have had a temperature excursion; Vaccinated with vial that might have had a temperature excursion; Vaccinated with vial that might have had a temperature excursion; A spontaneous report was received from a nurse concerning a patient who received Moderna's COVID-19 vaccine (mRNA-1273) and was vaccinated with vial that might have had a temperature excursion. The patient's medical history was not provided. No relevant concomitant medications were reported. On 22 Dec 2020, the nurse reported a shipment was received. The vial arrived frozen and was placed in a freezer at recommended temperature. On 02 Jan 2021, the freezer had failed, and the temperature alarm system did not alert anyone. It was noted that the freezer temperature at 5:50 AM was -5 degrees Celsius (C), at 7:50 AM the freezer was a 1.5 C. It remained between from 9.7 C at 12:51 PM then went to 8.3 C at 1:51 PM then down to -8.7 C at 2:51 PM. On 03 Jan 2021 8:45 PM, the freezer returned to its normal temperature of -20.9 C, -1 C then went to 5.5 C at 11:56 PM. On 04 Jan 2021, the temperature climbed to 19.4 C. On the same day, the patient received their first of two planned doses of mRNA-1273 (Lot number: 025J20-2A, 025L20A, or 027L20A) intramuscularly for prophylaxis of COVID-19 infection and experienced vaccination with vial that might have had a temperature excursion No treatment information was provided. Action taken with mRNA-1273 in response to the event was not reported. The outcome of the event, vaccinated with vial that might have had a temperature excursion, was considered resolved on 04 Jan 2021.; Reporter's Comments: This report refers to a case of out of specification product use, product temperature excursion issue, and product storage error for mRNA-1273. There were no reported AEs associated with this case. | 127 | < 0.1% | |
| Other values (544579) | 561354 | 98.6% | |
| (Missing) | 129 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 540408 ? |
|---|---|
| Unique (%) | 94.9% |
Histogram of lengths of the category
Length
| Max length | 29450 |
|---|---|
| Median length | 306 |
| Mean length | 627.9968007 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 57726515 | 16.1% | ||
| e | 32380982 | 9.1% | |
| t | 22374353 | 6.3% | |
| a | 21242358 | 5.9% | |
| n | 20785401 | 5.8% | |
| o | 18633914 | 5.2% | |
| i | 18472811 | 5.2% | |
| r | 15175120 | 4.2% | |
| s | 14808636 | 4.1% | |
| d | 12110904 | 3.4% | |
| h | 10009844 | 2.8% | |
| c | 9510749 | 2.7% | |
| l | 8119803 | 2.3% | |
| m | 6072443 | 1.7% | |
| u | 5861810 | 1.6% | |
| p | 5798801 | 1.6% | |
| f | 5337775 | 1.5% | |
| w | 4054996 | 1.1% | |
| g | 3779849 | 1.1% | |
| . | 3614769 | 1.0% | |
| v | 3497200 | 1.0% | |
| y | 3385480 | 0.9% | |
| I | 3025121 | 0.8% | |
| , | 2932321 | 0.8% | |
| b | 2821334 | 0.8% | |
| Other values (74) | 46111517 | 12.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 247465376 | 69.2% | |
| Space Separator | 57726515 | 16.1% | |
| Uppercase Letter | 27542959 | 7.7% | |
| Decimal Number | 10640764 | 3.0% | |
| Other Punctuation | 8804895 | 2.5% | |
| Dash Punctuation | 2147557 | 0.6% | |
| Open Punctuation | 1643588 | 0.5% | |
| Close Punctuation | 1643277 | 0.5% | |
| Math Symbol | 21407 | < 0.1% | |
| Other Symbol | 7156 | < 0.1% | |
| Connector Punctuation | 1006 | < 0.1% | |
| Modifier Symbol | 162 | < 0.1% | |
| Currency Symbol | 135 | < 0.1% | |
| Control | 9 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| I | 3025121 | 11.0% | |
| A | 2441430 | 8.9% | |
| T | 2192809 | 8.0% | |
| N | 2021837 | 7.3% | |
| O | 1956523 | 7.1% | |
| E | 1866744 | 6.8% | |
| C | 1787175 | 6.5% | |
| R | 1491414 | 5.4% | |
| S | 1445145 | 5.2% | |
| D | 1271891 | 4.6% | |
| V | 1166104 | 4.2% | |
| M | 1141707 | 4.1% | |
| P | 1093696 | 4.0% | |
| L | 848826 | 3.1% | |
| H | 816040 | 3.0% | |
| F | 581419 | 2.1% | |
| B | 548272 | 2.0% | |
| U | 496066 | 1.8% | |
| G | 306192 | 1.1% | |
| Y | 290037 | 1.1% | |
| W | 214928 | 0.8% | |
| J | 189437 | 0.7% | |
| Z | 136471 | 0.5% | |
| X | 114148 | 0.4% | |
| K | 87385 | 0.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 32380982 | 13.1% | |
| t | 22374353 | 9.0% | |
| a | 21242358 | 8.6% | |
| n | 20785401 | 8.4% | |
| o | 18633914 | 7.5% | |
| i | 18472811 | 7.5% | |
| r | 15175120 | 6.1% | |
| s | 14808636 | 6.0% | |
| d | 12110904 | 4.9% | |
| h | 10009844 | 4.0% | |
| c | 9510749 | 3.8% | |
| l | 8119803 | 3.3% | |
| m | 6072443 | 2.5% | |
| u | 5861810 | 2.4% | |
| p | 5798801 | 2.3% | |
| f | 5337775 | 2.2% | |
| w | 4054996 | 1.6% | |
| g | 3779849 | 1.5% | |
| v | 3497200 | 1.4% | |
| y | 3385480 | 1.4% | |
| b | 2821334 | 1.1% | |
| k | 1623773 | 0.7% | |
| x | 694573 | 0.3% | |
| j | 403128 | 0.2% | |
| z | 399459 | 0.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 57726515 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 2755202 | 25.9% | |
| 2 | 2635929 | 24.8% | |
| 0 | 1668702 | 15.7% | |
| 9 | 876026 | 8.2% | |
| 3 | 711759 | 6.7% | |
| 7 | 506925 | 4.8% | |
| 6 | 413129 | 3.9% | |
| 5 | 403076 | 3.8% | |
| 4 | 382337 | 3.6% | |
| 8 | 287679 | 2.7% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 3614769 | 41.1% | |
| , | 2932321 | 33.3% | |
| ; | 665392 | 7.6% | |
| / | 615478 | 7.0% | |
| : | 498754 | 5.7% | |
| ' | 265013 | 3.0% | |
| " | 96264 | 1.1% | |
| ? | 36122 | 0.4% | |
| % | 27468 | 0.3% | |
| & | 23309 | 0.3% | |
| # | 13918 | 0.2% | |
| @ | 5811 | 0.1% | |
| ! | 4860 | 0.1% | |
| * | 4343 | < 0.1% | |
| \ | 1070 | < 0.1% | |
| ; | 3 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 2147557 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 1625555 | 98.9% | |
| [ | 17922 | 1.1% | |
| { | 111 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 1625273 | 98.9% | |
| ] | 17891 | 1.1% | |
| } | 113 | < 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 7151 | 33.4% | |
| = | 4956 | 23.2% | |
| ~ | 4335 | 20.3% | |
| > | 3385 | 15.8% | |
| < | 1235 | 5.8% | |
| | | 345 | 1.6% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| � | 7156 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 1006 | 100.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 135 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ^ | 87 | 53.7% | |
| ` | 75 | 46.3% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| | 7 | 77.8% | |
| 2 | 22.2% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 275008335 | 76.9% | |
| Common | 82636471 | 23.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 32380982 | 11.8% | |
| t | 22374353 | 8.1% | |
| a | 21242358 | 7.7% | |
| n | 20785401 | 7.6% | |
| o | 18633914 | 6.8% | |
| i | 18472811 | 6.7% | |
| r | 15175120 | 5.5% | |
| s | 14808636 | 5.4% | |
| d | 12110904 | 4.4% | |
| h | 10009844 | 3.6% | |
| c | 9510749 | 3.5% | |
| l | 8119803 | 3.0% | |
| m | 6072443 | 2.2% | |
| u | 5861810 | 2.1% | |
| p | 5798801 | 2.1% | |
| f | 5337775 | 1.9% | |
| w | 4054996 | 1.5% | |
| g | 3779849 | 1.4% | |
| v | 3497200 | 1.3% | |
| y | 3385480 | 1.2% | |
| I | 3025121 | 1.1% | |
| b | 2821334 | 1.0% | |
| A | 2441430 | 0.9% | |
| T | 2192809 | 0.8% | |
| N | 2021837 | 0.7% | |
| Other values (27) | 21092575 | 7.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 57726515 | 69.9% | ||
| . | 3614769 | 4.4% | |
| , | 2932321 | 3.5% | |
| 1 | 2755202 | 3.3% | |
| 2 | 2635929 | 3.2% | |
| - | 2147557 | 2.6% | |
| 0 | 1668702 | 2.0% | |
| ( | 1625555 | 2.0% | |
| ) | 1625273 | 2.0% | |
| 9 | 876026 | 1.1% | |
| 3 | 711759 | 0.9% | |
| ; | 665392 | 0.8% | |
| / | 615478 | 0.7% | |
| 7 | 506925 | 0.6% | |
| : | 498754 | 0.6% | |
| 6 | 413129 | 0.5% | |
| 5 | 403076 | 0.5% | |
| 4 | 382337 | 0.5% | |
| 8 | 287679 | 0.3% | |
| ' | 265013 | 0.3% | |
| " | 96264 | 0.1% | |
| ? | 36122 | < 0.1% | |
| % | 27468 | < 0.1% | |
| & | 23309 | < 0.1% | |
| [ | 17922 | < 0.1% | |
| Other values (22) | 77995 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 357637647 | > 99.9% | |
| Specials | 7156 | < 0.1% | |
| None | 3 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 57726515 | 16.1% | ||
| e | 32380982 | 9.1% | |
| t | 22374353 | 6.3% | |
| a | 21242358 | 5.9% | |
| n | 20785401 | 5.8% | |
| o | 18633914 | 5.2% | |
| i | 18472811 | 5.2% | |
| r | 15175120 | 4.2% | |
| s | 14808636 | 4.1% | |
| d | 12110904 | 3.4% | |
| h | 10009844 | 2.8% | |
| c | 9510749 | 2.7% | |
| l | 8119803 | 2.3% | |
| m | 6072443 | 1.7% | |
| u | 5861810 | 1.6% | |
| p | 5798801 | 1.6% | |
| f | 5337775 | 1.5% | |
| w | 4054996 | 1.1% | |
| g | 3779849 | 1.1% | |
| . | 3614769 | 1.0% | |
| v | 3497200 | 1.0% | |
| y | 3385480 | 0.9% | |
| I | 3025121 | 0.8% | |
| , | 2932321 | 0.8% | |
| b | 2821334 | 0.8% | |
| Other values (72) | 46104358 | 12.9% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
| � | 7156 | 100.0% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| ; | 3 | 100.0% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 562319 |
| Missing (%) | 98.7% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 7182 | 1.3% | |
| (Missing) | 562319 | 98.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.97477792 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1124638 | 66.4% | |
| a | 562319 | 33.2% | |
| Y | 7182 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1686957 | 99.6% | |
| Uppercase Letter | 7182 | 0.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1124638 | 66.7% | |
| a | 562319 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 7182 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1694139 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1124638 | 66.4% | |
| a | 562319 | 33.2% | |
| Y | 7182 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1694139 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1124638 | 66.4% | |
| a | 562319 | 33.2% | |
| Y | 7182 | 0.4% |
| Distinct | 297 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 563051 |
| Missing (%) | 98.9% |
| Memory size | 4.3 MiB |
| 04/01/2021 | 72 |
|---|---|
| 03/05/2021 | 57 |
| 02/01/2021 | 54 |
| 02/12/2021 | 54 |
| 03/01/2021 | 53 |
| Other values (292) |
| Value | Count | Frequency (%) | |
| 04/01/2021 | 72 | < 0.1% | |
| 03/05/2021 | 57 | < 0.1% | |
| 02/01/2021 | 54 | < 0.1% | |
| 02/12/2021 | 54 | < 0.1% | |
| 03/01/2021 | 53 | < 0.1% | |
| 03/30/2021 | 51 | < 0.1% | |
| 04/06/2021 | 51 | < 0.1% | |
| 05/01/2021 | 50 | < 0.1% | |
| 03/19/2021 | 49 | < 0.1% | |
| 04/12/2021 | 49 | < 0.1% | |
| 02/13/2021 | 48 | < 0.1% | |
| 04/15/2021 | 48 | < 0.1% | |
| 02/11/2021 | 48 | < 0.1% | |
| 03/18/2021 | 48 | < 0.1% | |
| 03/12/2021 | 48 | < 0.1% | |
| 03/29/2021 | 47 | < 0.1% | |
| 04/10/2021 | 47 | < 0.1% | |
| 02/26/2021 | 47 | < 0.1% | |
| 04/03/2021 | 46 | < 0.1% | |
| 04/08/2021 | 46 | < 0.1% | |
| 03/24/2021 | 45 | < 0.1% | |
| 02/24/2021 | 45 | < 0.1% | |
| 03/13/2021 | 45 | < 0.1% | |
| 02/05/2021 | 45 | < 0.1% | |
| 02/21/2021 | 44 | < 0.1% | |
| Other values (272) | 5213 | 0.9% | |
| (Missing) | 563051 | 98.9% |
Frequencies of value counts
Unique
| Unique | 30 ? |
|---|---|
| Unique (%) | 0.5% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 3 |
| Mean length | 3.079279931 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1126102 | 64.2% | |
| a | 563051 | 32.1% | |
| 2 | 16650 | 0.9% | |
| 0 | 15466 | 0.9% | |
| / | 12900 | 0.7% | |
| 1 | 10238 | 0.6% | |
| 3 | 2213 | 0.1% | |
| 4 | 1775 | 0.1% | |
| 5 | 1446 | 0.1% | |
| 8 | 1149 | 0.1% | |
| 6 | 974 | 0.1% | |
| 7 | 922 | 0.1% | |
| 9 | 767 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1689153 | 96.3% | |
| Decimal Number | 51600 | 2.9% | |
| Other Punctuation | 12900 | 0.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1126102 | 66.7% | |
| a | 563051 | 33.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 16650 | 32.3% | |
| 0 | 15466 | 30.0% | |
| 1 | 10238 | 19.8% | |
| 3 | 2213 | 4.3% | |
| 4 | 1775 | 3.4% | |
| 5 | 1446 | 2.8% | |
| 8 | 1149 | 2.2% | |
| 6 | 974 | 1.9% | |
| 7 | 922 | 1.8% | |
| 9 | 767 | 1.5% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 12900 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1689153 | 96.3% | |
| Common | 64500 | 3.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1126102 | 66.7% | |
| a | 563051 | 33.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 16650 | 25.8% | |
| 0 | 15466 | 24.0% | |
| / | 12900 | 20.0% | |
| 1 | 10238 | 15.9% | |
| 3 | 2213 | 3.4% | |
| 4 | 1775 | 2.8% | |
| 5 | 1446 | 2.2% | |
| 8 | 1149 | 1.8% | |
| 6 | 974 | 1.5% | |
| 7 | 922 | 1.4% | |
| 9 | 767 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1753653 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1126102 | 64.2% | |
| a | 563051 | 32.1% | |
| 2 | 16650 | 0.9% | |
| 0 | 15466 | 0.9% | |
| / | 12900 | 0.7% | |
| 1 | 10238 | 0.6% | |
| 3 | 2213 | 0.1% | |
| 4 | 1775 | 0.1% | |
| 5 | 1446 | 0.1% | |
| 8 | 1149 | 0.1% | |
| 6 | 974 | 0.1% | |
| 7 | 922 | 0.1% | |
| 9 | 767 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 560732 |
| Missing (%) | 98.5% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 8769 | 1.5% | |
| (Missing) | 560732 | 98.5% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.969204619 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1121464 | 66.3% | |
| a | 560732 | 33.2% | |
| Y | 8769 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1682196 | 99.5% | |
| Uppercase Letter | 8769 | 0.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1121464 | 66.7% | |
| a | 560732 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 8769 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1690965 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1121464 | 66.3% | |
| a | 560732 | 33.2% | |
| Y | 8769 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1690965 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1121464 | 66.3% | |
| a | 560732 | 33.2% | |
| Y | 8769 | 0.5% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 569449 |
| Missing (%) | > 99.9% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 52 | < 0.1% | |
| (Missing) | 569449 | > 99.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.999817384 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1138898 | 66.7% | |
| a | 569449 | 33.3% | |
| Y | 52 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1708347 | > 99.9% | |
| Uppercase Letter | 52 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1138898 | 66.7% | |
| a | 569449 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 52 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1708399 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1138898 | 66.7% | |
| a | 569449 | 33.3% | |
| Y | 52 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1708399 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1138898 | 66.7% | |
| a | 569449 | 33.3% | |
| Y | 52 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 536233 |
| Missing (%) | 94.2% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 33268 | 5.8% | |
| (Missing) | 536233 | 94.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.883167896 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1072466 | 65.3% | |
| a | 536233 | 32.7% | |
| Y | 33268 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1608699 | 98.0% | |
| Uppercase Letter | 33268 | 2.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1072466 | 66.7% | |
| a | 536233 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 33268 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1641967 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1072466 | 65.3% | |
| a | 536233 | 32.7% | |
| Y | 33268 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1641967 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1072466 | 65.3% | |
| a | 536233 | 32.7% | |
| Y | 33268 | 2.0% |
| Distinct | 97 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 546713 |
| Missing (%) | 96.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.09193435 |
|---|---|
| Minimum | 1 |
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 16 |
| Maximum | 99999 |
| Range | 99998 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 936.828851 |
|---|---|
| Coefficient of variation (CV) | 66.47979103 |
| Kurtosis | 11387.79309 |
| Mean | 14.09193435 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 106.7097018 |
| Sum | 321127 |
| Variance | 877648.296 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2 | 4736 | 0.8% | |
| 3 | 3895 | 0.7% | |
| 1 | 3893 | 0.7% | |
| 4 | 2549 | 0.4% | |
| 5 | 1923 | 0.3% | |
| 6 | 1090 | 0.2% | |
| 7 | 986 | 0.2% | |
| 8 | 567 | 0.1% | |
| 10 | 448 | 0.1% | |
| 9 | 379 | 0.1% | |
| 12 | 279 | < 0.1% | |
| 11 | 270 | < 0.1% | |
| 14 | 257 | < 0.1% | |
| 13 | 191 | < 0.1% | |
| 15 | 156 | < 0.1% | |
| 16 | 117 | < 0.1% | |
| 21 | 107 | < 0.1% | |
| 20 | 90 | < 0.1% | |
| 17 | 85 | < 0.1% | |
| 18 | 81 | < 0.1% | |
| 19 | 67 | < 0.1% | |
| 30 | 60 | < 0.1% | |
| 25 | 46 | < 0.1% | |
| 23 | 44 | < 0.1% | |
| 24 | 44 | < 0.1% | |
| Other values (72) | 428 | 0.1% | |
| (Missing) | 546713 | 96.0% |
| Value | Count | Frequency (%) | |
| 1 | 3893 | 0.7% | |
| 2 | 4736 | 0.8% | |
| 3 | 3895 | 0.7% | |
| 4 | 2549 | 0.4% | |
| 5 | 1923 | 0.3% | |
| 6 | 1090 | 0.2% | |
| 7 | 986 | 0.2% | |
| 8 | 567 | 0.1% | |
| 9 | 379 | 0.1% | |
| 10 | 448 | 0.1% |
| Value | Count | Frequency (%) | |
| 99999 | 2 | < 0.1% | |
| 999 | 1 | < 0.1% | |
| 731 | 1 | < 0.1% | |
| 699 | 1 | < 0.1% | |
| 180 | 1 | < 0.1% | |
| 171 | 1 | < 0.1% | |
| 150 | 1 | < 0.1% | |
| 136 | 1 | < 0.1% | |
| 132 | 1 | < 0.1% | |
| 127 | 1 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 569195 |
| Missing (%) | 99.9% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 306 | 0.1% | |
| (Missing) | 569195 | 99.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.998925375 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1138390 | 66.7% | |
| a | 569195 | 33.3% | |
| Y | 306 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1707585 | > 99.9% | |
| Uppercase Letter | 306 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1138390 | 66.7% | |
| a | 569195 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 306 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1707891 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1138390 | 66.7% | |
| a | 569195 | 33.3% | |
| Y | 306 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1707891 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1138390 | 66.7% | |
| a | 569195 | 33.3% | |
| Y | 306 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 560782 |
| Missing (%) | 98.5% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 8719 | 1.5% | |
| (Missing) | 560782 | 98.5% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.969380212 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1121564 | 66.3% | |
| a | 560782 | 33.2% | |
| Y | 8719 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1682346 | 99.5% | |
| Uppercase Letter | 8719 | 0.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1121564 | 66.7% | |
| a | 560782 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 8719 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1691065 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1121564 | 66.3% | |
| a | 560782 | 33.2% | |
| Y | 8719 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1691065 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1121564 | 66.3% | |
| a | 560782 | 33.2% | |
| Y | 8719 | 0.5% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49549 |
| Missing (%) | 8.7% |
| Memory size | 4.3 MiB |
| N | |
|---|---|
| Y | |
| U |
| Value | Count | Frequency (%) | |
| N | 198595 | 34.9% | |
| Y | 190505 | 33.5% | |
| U | 130852 | 23.0% | |
| (Missing) | 49549 | 8.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.174008474 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| N | 198595 | 29.7% | |
| Y | 190505 | 28.5% | |
| U | 130852 | 19.6% | |
| n | 99098 | 14.8% | |
| a | 49549 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 519952 | 77.8% | |
| Lowercase Letter | 148647 | 22.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 198595 | 38.2% | |
| Y | 190505 | 36.6% | |
| U | 130852 | 25.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 99098 | 66.7% | |
| a | 49549 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 668599 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| N | 198595 | 29.7% | |
| Y | 190505 | 28.5% | |
| U | 130852 | 19.6% | |
| n | 99098 | 14.8% | |
| a | 49549 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 668599 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| N | 198595 | 29.7% | |
| Y | 190505 | 28.5% | |
| U | 130852 | 19.6% | |
| n | 99098 | 14.8% | |
| a | 49549 | 7.4% |
| Distinct | 1701 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 39266 |
| Missing (%) | 6.9% |
| Memory size | 4.3 MiB |
| 04/01/2021 | 7797 |
|---|---|
| 04/08/2021 | 6402 |
| 04/07/2021 | 6116 |
| 04/09/2021 | 5958 |
| 04/06/2021 | 5436 |
| Other values (1696) |
| Value | Count | Frequency (%) | |
| 04/01/2021 | 7797 | 1.4% | |
| 04/08/2021 | 6402 | 1.1% | |
| 04/07/2021 | 6116 | 1.1% | |
| 04/09/2021 | 5958 | 1.0% | |
| 04/06/2021 | 5436 | 1.0% | |
| 01/08/2021 | 5148 | 0.9% | |
| 03/01/2021 | 5073 | 0.9% | |
| 03/31/2021 | 5051 | 0.9% | |
| 03/12/2021 | 5048 | 0.9% | |
| 01/04/2021 | 4972 | 0.9% | |
| 03/11/2021 | 4955 | 0.9% | |
| 01/07/2021 | 4842 | 0.9% | |
| 04/02/2021 | 4819 | 0.8% | |
| 01/06/2021 | 4740 | 0.8% | |
| 03/26/2021 | 4726 | 0.8% | |
| 03/25/2021 | 4677 | 0.8% | |
| 03/18/2021 | 4642 | 0.8% | |
| 01/27/2021 | 4619 | 0.8% | |
| 01/20/2021 | 4605 | 0.8% | |
| 03/04/2021 | 4602 | 0.8% | |
| 03/05/2021 | 4595 | 0.8% | |
| 03/10/2021 | 4574 | 0.8% | |
| 02/04/2021 | 4568 | 0.8% | |
| 02/01/2021 | 4553 | 0.8% | |
| 01/28/2021 | 4545 | 0.8% | |
| Other values (1676) | 403172 | 70.8% | |
| (Missing) | 39266 | 6.9% |
Frequencies of value counts
Unique
| Unique | 1001 ? |
|---|---|
| Unique (%) | 0.2% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.517363446 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 1378796 | 25.4% | |
| 0 | 1288064 | 23.8% | |
| / | 1060470 | 19.6% | |
| 1 | 868615 | 16.0% | |
| 3 | 191477 | 3.5% | |
| 4 | 149770 | 2.8% | |
| 5 | 99018 | 1.8% | |
| n | 78532 | 1.4% | |
| 6 | 76032 | 1.4% | |
| 8 | 69876 | 1.3% | |
| 7 | 63113 | 1.2% | |
| 9 | 57119 | 1.1% | |
| a | 39266 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4241880 | 78.3% | |
| Other Punctuation | 1060470 | 19.6% | |
| Lowercase Letter | 117798 | 2.2% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 1378796 | 32.5% | |
| 0 | 1288064 | 30.4% | |
| 1 | 868615 | 20.5% | |
| 3 | 191477 | 4.5% | |
| 4 | 149770 | 3.5% | |
| 5 | 99018 | 2.3% | |
| 6 | 76032 | 1.8% | |
| 8 | 69876 | 1.6% | |
| 7 | 63113 | 1.5% | |
| 9 | 57119 | 1.3% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1060470 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 78532 | 66.7% | |
| a | 39266 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5302350 | 97.8% | |
| Latin | 117798 | 2.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 1378796 | 26.0% | |
| 0 | 1288064 | 24.3% | |
| / | 1060470 | 20.0% | |
| 1 | 868615 | 16.4% | |
| 3 | 191477 | 3.6% | |
| 4 | 149770 | 2.8% | |
| 5 | 99018 | 1.9% | |
| 6 | 76032 | 1.4% | |
| 8 | 69876 | 1.3% | |
| 7 | 63113 | 1.2% | |
| 9 | 57119 | 1.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 78532 | 66.7% | |
| a | 39266 | 33.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5420148 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 1378796 | 25.4% | |
| 0 | 1288064 | 23.8% | |
| / | 1060470 | 19.6% | |
| 1 | 868615 | 16.0% | |
| 3 | 191477 | 3.5% | |
| 4 | 149770 | 2.8% | |
| 5 | 99018 | 1.8% | |
| n | 78532 | 1.4% | |
| 6 | 76032 | 1.4% | |
| 8 | 69876 | 1.3% | |
| 7 | 63113 | 1.2% | |
| 9 | 57119 | 1.1% | |
| a | 39266 | 0.7% |
| Distinct | 1126 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 44284 |
| Missing (%) | 7.8% |
| Memory size | 4.3 MiB |
| 04/01/2021 | 8744 |
|---|---|
| 04/08/2021 | 6029 |
| 03/01/2021 | 6009 |
| 04/09/2021 | 5954 |
| 04/07/2021 | 5524 |
| Other values (1121) |
| Value | Count | Frequency (%) | |
| 04/01/2021 | 8744 | 1.5% | |
| 04/08/2021 | 6029 | 1.1% | |
| 03/01/2021 | 6009 | 1.1% | |
| 04/09/2021 | 5954 | 1.0% | |
| 04/07/2021 | 5524 | 1.0% | |
| 04/10/2021 | 4868 | 0.9% | |
| 04/02/2021 | 4814 | 0.8% | |
| 04/06/2021 | 4648 | 0.8% | |
| 02/01/2021 | 4525 | 0.8% | |
| 03/12/2021 | 4458 | 0.8% | |
| 03/31/2021 | 4376 | 0.8% | |
| 03/25/2021 | 4162 | 0.7% | |
| 01/08/2021 | 4161 | 0.7% | |
| 03/11/2021 | 4159 | 0.7% | |
| 04/12/2021 | 4155 | 0.7% | |
| 03/26/2021 | 4155 | 0.7% | |
| 05/01/2021 | 4145 | 0.7% | |
| 01/07/2021 | 4058 | 0.7% | |
| 03/10/2021 | 4026 | 0.7% | |
| 04/14/2021 | 3957 | 0.7% | |
| 04/15/2021 | 3955 | 0.7% | |
| 03/18/2021 | 3953 | 0.7% | |
| 03/19/2021 | 3921 | 0.7% | |
| 03/24/2021 | 3914 | 0.7% | |
| 01/06/2021 | 3866 | 0.7% | |
| Other values (1101) | 408681 | 71.8% | |
| (Missing) | 44284 | 7.8% |
Frequencies of value counts
Unique
| Unique | 546 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.455684889 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 1345081 | 25.0% | |
| 0 | 1279711 | 23.8% | |
| / | 1050434 | 19.5% | |
| 1 | 858200 | 15.9% | |
| 3 | 182251 | 3.4% | |
| 4 | 158060 | 2.9% | |
| 5 | 103952 | 1.9% | |
| n | 88568 | 1.6% | |
| 6 | 77298 | 1.4% | |
| 8 | 73174 | 1.4% | |
| 7 | 67759 | 1.3% | |
| 9 | 56250 | 1.0% | |
| a | 44284 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4201736 | 78.0% | |
| Other Punctuation | 1050434 | 19.5% | |
| Lowercase Letter | 132852 | 2.5% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 1345081 | 32.0% | |
| 0 | 1279711 | 30.5% | |
| 1 | 858200 | 20.4% | |
| 3 | 182251 | 4.3% | |
| 4 | 158060 | 3.8% | |
| 5 | 103952 | 2.5% | |
| 6 | 77298 | 1.8% | |
| 8 | 73174 | 1.7% | |
| 7 | 67759 | 1.6% | |
| 9 | 56250 | 1.3% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1050434 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 88568 | 66.7% | |
| a | 44284 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5252170 | 97.5% | |
| Latin | 132852 | 2.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 1345081 | 25.6% | |
| 0 | 1279711 | 24.4% | |
| / | 1050434 | 20.0% | |
| 1 | 858200 | 16.3% | |
| 3 | 182251 | 3.5% | |
| 4 | 158060 | 3.0% | |
| 5 | 103952 | 2.0% | |
| 6 | 77298 | 1.5% | |
| 8 | 73174 | 1.4% | |
| 7 | 67759 | 1.3% | |
| 9 | 56250 | 1.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 88568 | 66.7% | |
| a | 44284 | 33.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5385022 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 1345081 | 25.0% | |
| 0 | 1279711 | 23.8% | |
| / | 1050434 | 19.5% | |
| 1 | 858200 | 15.9% | |
| 3 | 182251 | 3.4% | |
| 4 | 158060 | 2.9% | |
| 5 | 103952 | 1.9% | |
| n | 88568 | 1.6% | |
| 6 | 77298 | 1.4% | |
| 8 | 73174 | 1.4% | |
| 7 | 67759 | 1.3% | |
| 9 | 56250 | 1.0% | |
| a | 44284 | 0.8% |
| Distinct | 826 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 66139 |
| Missing (%) | 11.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.94008884 |
|---|---|
| Minimum | 0 |
| Maximum | 44224 |
| Zeros | 221204 |
| Zeros (%) | 38.8% |
| Memory size | 4.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 35 |
| Maximum | 44224 |
| Range | 44224 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 612.6793535 |
|---|---|
| Coefficient of variation (CV) | 25.59219214 |
| Kurtosis | 2408.760857 |
| Mean | 23.94008884 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 46.71933369 |
| Sum | 12050531 |
| Variance | 375375.9902 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 221204 | 38.8% | |
| 1 | 102518 | 18.0% | |
| 2 | 27504 | 4.8% | |
| 7 | 15971 | 2.8% | |
| 3 | 15329 | 2.7% | |
| 8 | 12566 | 2.2% | |
| 4 | 10128 | 1.8% | |
| 5 | 8218 | 1.4% | |
| 6 | 8024 | 1.4% | |
| 9 | 7190 | 1.3% | |
| 10 | 5128 | 0.9% | |
| 28 | 4225 | 0.7% | |
| 14 | 4023 | 0.7% | |
| 11 | 3764 | 0.7% | |
| 12 | 3250 | 0.6% | |
| 13 | 2852 | 0.5% | |
| 29 | 2251 | 0.4% | |
| 21 | 2179 | 0.4% | |
| 15 | 2154 | 0.4% | |
| 16 | 1798 | 0.3% | |
| 17 | 1621 | 0.3% | |
| 31 | 1531 | 0.3% | |
| 19 | 1512 | 0.3% | |
| 18 | 1490 | 0.3% | |
| 20 | 1427 | 0.3% | |
| Other values (801) | 35505 | 6.2% | |
| (Missing) | 66139 | 11.6% |
| Value | Count | Frequency (%) | |
| 0 | 221204 | 38.8% | |
| 1 | 102518 | 18.0% | |
| 2 | 27504 | 4.8% | |
| 3 | 15329 | 2.7% | |
| 4 | 10128 | 1.8% | |
| 5 | 8218 | 1.4% | |
| 6 | 8024 | 1.4% | |
| 7 | 15971 | 2.8% | |
| 8 | 12566 | 2.2% | |
| 9 | 7190 | 1.3% |
| Value | Count | Frequency (%) | |
| 44224 | 1 | < 0.1% | |
| 44195 | 2 | < 0.1% | |
| 36896 | 1 | < 0.1% | |
| 36890 | 1 | < 0.1% | |
| 36697 | 1 | < 0.1% | |
| 36573 | 1 | < 0.1% | |
| 36564 | 1 | < 0.1% | |
| 36561 | 1 | < 0.1% | |
| 36555 | 1 | < 0.1% | |
| 36553 | 2 | < 0.1% |
| Distinct | 125645 |
|---|---|
| Distinct (%) | 56.3% |
| Missing | 346365 |
| Missing (%) | 60.8% |
| Memory size | 4.3 MiB |
| None | |
|---|---|
| none | |
| None. | 3130 |
| NONE | 2815 |
| no | 1921 |
| Other values (125640) |
| Value | Count | Frequency (%) | |
| None | 43056 | 7.6% | |
| none | 23993 | 4.2% | |
| None. | 3130 | 0.5% | |
| NONE | 2815 | 0.5% | |
| no | 1921 | 0.3% | |
| No | 1693 | 0.3% | |
| unknown | 1330 | 0.2% | |
| N/a | 1294 | 0.2% | |
| Unknown | 1059 | 0.2% | |
| na | 716 | 0.1% | |
| None yet | 683 | 0.1% | |
| Na | 425 | 0.1% | |
| None at this time | 268 | < 0.1% | |
| EKG | 268 | < 0.1% | |
| UNKNOWN | 267 | < 0.1% | |
| none yet | 242 | < 0.1% | |
| none. | 240 | < 0.1% | |
| NO | 203 | < 0.1% | |
| None yet. | 195 | < 0.1% | |
| see above | 185 | < 0.1% | |
| SEE ABOVE | 176 | < 0.1% | |
| See above | 173 | < 0.1% | |
| None at this time. | 160 | < 0.1% | |
| None to date | 157 | < 0.1% | |
| 0 | 153 | < 0.1% | |
| Other values (125620) | 138334 | 24.3% | |
| (Missing) | 346365 | 60.8% |
Frequencies of value counts
Unique
| Unique | 122560 ? |
|---|---|
| Unique (%) | 54.9% |
Histogram of lengths of the category
Length
| Max length | 32000 |
|---|---|
| Median length | 3 |
| Mean length | 35.863484 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 3177003 | 15.6% | ||
| e | 1635913 | 8.0% | |
| n | 1558105 | 7.6% | |
| a | 1364006 | 6.7% | |
| t | 1325407 | 6.5% | |
| s | 915067 | 4.5% | |
| o | 900999 | 4.4% | |
| r | 730773 | 3.6% | |
| i | 706283 | 3.5% | |
| l | 566185 | 2.8% | |
| d | 475986 | 2.3% | |
| u | 456778 | 2.2% | |
| c | 365837 | 1.8% | |
| m | 354954 | 1.7% | |
| 2 | 303622 | 1.5% | |
| h | 299639 | 1.5% | |
| 1 | 279691 | 1.4% | |
| : | 266256 | 1.3% | |
| 0 | 253003 | 1.2% | |
| T | 246441 | 1.2% | |
| p | 219310 | 1.1% | |
| N | 217693 | 1.1% | |
| g | 196883 | 1.0% | |
| R | 189483 | 0.9% | |
| . | 185736 | 0.9% | |
| Other values (72) | 3233237 | 15.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 12947337 | 63.4% | |
| Space Separator | 3177003 | 15.6% | |
| Uppercase Letter | 1969317 | 9.6% | |
| Decimal Number | 1225687 | 6.0% | |
| Other Punctuation | 926953 | 4.5% | |
| Dash Punctuation | 100555 | 0.5% | |
| Close Punctuation | 30998 | 0.2% | |
| Open Punctuation | 30992 | 0.2% | |
| Math Symbol | 11905 | 0.1% | |
| Connector Punctuation | 2373 | < 0.1% | |
| Other Symbol | 855 | < 0.1% | |
| Modifier Symbol | 271 | < 0.1% | |
| Currency Symbol | 43 | < 0.1% | |
| Other Number | 1 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| T | 246441 | 12.5% | |
| N | 217693 | 11.1% | |
| R | 189483 | 9.6% | |
| D | 157357 | 8.0% | |
| C | 153702 | 7.8% | |
| I | 108891 | 5.5% | |
| E | 93819 | 4.8% | |
| A | 89546 | 4.5% | |
| U | 86833 | 4.4% | |
| S | 82377 | 4.2% | |
| P | 81410 | 4.1% | |
| O | 73566 | 3.7% | |
| B | 60209 | 3.1% | |
| M | 59726 | 3.0% | |
| L | 57483 | 2.9% | |
| V | 45186 | 2.3% | |
| H | 42800 | 2.2% | |
| G | 35265 | 1.8% | |
| F | 28427 | 1.4% | |
| K | 17073 | 0.9% | |
| W | 15268 | 0.8% | |
| X | 10410 | 0.5% | |
| Y | 7103 | 0.4% | |
| J | 5782 | 0.3% | |
| Q | 2474 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 1635913 | 12.6% | |
| n | 1558105 | 12.0% | |
| a | 1364006 | 10.5% | |
| t | 1325407 | 10.2% | |
| s | 915067 | 7.1% | |
| o | 900999 | 7.0% | |
| r | 730773 | 5.6% | |
| i | 706283 | 5.5% | |
| l | 566185 | 4.4% | |
| d | 475986 | 3.7% | |
| u | 456778 | 3.5% | |
| c | 365837 | 2.8% | |
| m | 354954 | 2.7% | |
| h | 299639 | 2.3% | |
| p | 219310 | 1.7% | |
| g | 196883 | 1.5% | |
| y | 166791 | 1.3% | |
| v | 155745 | 1.2% | |
| f | 152401 | 1.2% | |
| w | 145769 | 1.1% | |
| b | 142403 | 1.1% | |
| k | 64584 | 0.5% | |
| x | 29578 | 0.2% | |
| z | 8519 | 0.1% | |
| j | 5260 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 303622 | 24.8% | |
| 1 | 279691 | 22.8% | |
| 0 | 253003 | 20.6% | |
| 3 | 73484 | 6.0% | |
| 4 | 65194 | 5.3% | |
| 9 | 60099 | 4.9% | |
| 5 | 58516 | 4.8% | |
| 6 | 45518 | 3.7% | |
| 8 | 45242 | 3.7% | |
| 7 | 41318 | 3.4% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| : | 266256 | 28.7% | |
| . | 185736 | 20.0% | |
| ; | 159944 | 17.3% | |
| / | 155708 | 16.8% | |
| , | 127258 | 13.7% | |
| % | 8911 | 1.0% | |
| ' | 7167 | 0.8% | |
| ? | 5076 | 0.5% | |
| * | 3514 | 0.4% | |
| " | 2466 | 0.3% | |
| & | 2302 | 0.2% | |
| # | 971 | 0.1% | |
| @ | 890 | 0.1% | |
| ! | 701 | 0.1% | |
| \ | 53 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 3177003 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 100555 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 29047 | 93.7% | |
| [ | 1502 | 4.8% | |
| { | 443 | 1.4% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 29060 | 93.7% | |
| ] | 1493 | 4.8% | |
| } | 445 | 1.4% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| = | 3351 | 28.1% | |
| + | 2246 | 18.9% | |
| | | 2194 | 18.4% | |
| < | 2063 | 17.3% | |
| > | 1771 | 14.9% | |
| ~ | 280 | 2.4% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| � | 855 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 2373 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ^ | 260 | 95.9% | |
| ` | 11 | 4.1% |
Most frequent Other Number characters
| Value | Count | Frequency (%) | |
| ² | 1 | 100.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 43 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 14916654 | 73.0% | |
| Common | 5507636 | 27.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 1635913 | 11.0% | |
| n | 1558105 | 10.4% | |
| a | 1364006 | 9.1% | |
| t | 1325407 | 8.9% | |
| s | 915067 | 6.1% | |
| o | 900999 | 6.0% | |
| r | 730773 | 4.9% | |
| i | 706283 | 4.7% | |
| l | 566185 | 3.8% | |
| d | 475986 | 3.2% | |
| u | 456778 | 3.1% | |
| c | 365837 | 2.5% | |
| m | 354954 | 2.4% | |
| h | 299639 | 2.0% | |
| T | 246441 | 1.7% | |
| p | 219310 | 1.5% | |
| N | 217693 | 1.5% | |
| g | 196883 | 1.3% | |
| R | 189483 | 1.3% | |
| y | 166791 | 1.1% | |
| D | 157357 | 1.1% | |
| v | 155745 | 1.0% | |
| C | 153702 | 1.0% | |
| f | 152401 | 1.0% | |
| w | 145769 | 1.0% | |
| Other values (27) | 1259147 | 8.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 3177003 | 57.7% | ||
| 2 | 303622 | 5.5% | |
| 1 | 279691 | 5.1% | |
| : | 266256 | 4.8% | |
| 0 | 253003 | 4.6% | |
| . | 185736 | 3.4% | |
| ; | 159944 | 2.9% | |
| / | 155708 | 2.8% | |
| , | 127258 | 2.3% | |
| - | 100555 | 1.8% | |
| 3 | 73484 | 1.3% | |
| 4 | 65194 | 1.2% | |
| 9 | 60099 | 1.1% | |
| 5 | 58516 | 1.1% | |
| 6 | 45518 | 0.8% | |
| 8 | 45242 | 0.8% | |
| 7 | 41318 | 0.8% | |
| ) | 29060 | 0.5% | |
| ( | 29047 | 0.5% | |
| % | 8911 | 0.2% | |
| ' | 7167 | 0.1% | |
| ? | 5076 | 0.1% | |
| * | 3514 | 0.1% | |
| = | 3351 | 0.1% | |
| " | 2466 | < 0.1% | |
| Other values (20) | 20897 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 20423434 | > 99.9% | |
| Specials | 855 | < 0.1% | |
| Latin 1 Sup | 1 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 3177003 | 15.6% | ||
| e | 1635913 | 8.0% | |
| n | 1558105 | 7.6% | |
| a | 1364006 | 6.7% | |
| t | 1325407 | 6.5% | |
| s | 915067 | 4.5% | |
| o | 900999 | 4.4% | |
| r | 730773 | 3.6% | |
| i | 706283 | 3.5% | |
| l | 566185 | 2.8% | |
| d | 475986 | 2.3% | |
| u | 456778 | 2.2% | |
| c | 365837 | 1.8% | |
| m | 354954 | 1.7% | |
| 2 | 303622 | 1.5% | |
| h | 299639 | 1.5% | |
| 1 | 279691 | 1.4% | |
| : | 266256 | 1.3% | |
| 0 | 253003 | 1.2% | |
| T | 246441 | 1.2% | |
| p | 219310 | 1.1% | |
| N | 217693 | 1.1% | |
| g | 196883 | 1.0% | |
| R | 189483 | 0.9% | |
| . | 185736 | 0.9% | |
| Other values (70) | 3232381 | 15.8% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
| � | 855 | 100.0% |
Most frequent Latin 1 Sup characters
| Value | Count | Frequency (%) | |
| ² | 1 | 100.0% |
V_ADMINBY
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.3 MiB |
| UNK | |
|---|---|
| PVT | |
| PHM | |
| OTH | |
| PUB | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| UNK | 162869 | 28.6% | |
| PVT | 122742 | 21.6% | |
| PHM | 100348 | 17.6% | |
| OTH | 82036 | 14.4% | |
| PUB | 58015 | 10.2% | |
| WRK | 19167 | 3.4% | |
| SCH | 8742 | 1.5% | |
| SEN | 8661 | 1.5% | |
| MIL | 6921 | 1.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| P | 281105 | 16.5% | |
| U | 220884 | 12.9% | |
| T | 204778 | 12.0% | |
| H | 191126 | 11.2% | |
| K | 182036 | 10.7% | |
| N | 171530 | 10.0% | |
| V | 122742 | 7.2% | |
| M | 107269 | 6.3% | |
| O | 82036 | 4.8% | |
| B | 58015 | 3.4% | |
| W | 19167 | 1.1% | |
| R | 19167 | 1.1% | |
| S | 17403 | 1.0% | |
| C | 8742 | 0.5% | |
| E | 8661 | 0.5% | |
| I | 6921 | 0.4% | |
| L | 6921 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 1708503 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| P | 281105 | 16.5% | |
| U | 220884 | 12.9% | |
| T | 204778 | 12.0% | |
| H | 191126 | 11.2% | |
| K | 182036 | 10.7% | |
| N | 171530 | 10.0% | |
| V | 122742 | 7.2% | |
| M | 107269 | 6.3% | |
| O | 82036 | 4.8% | |
| B | 58015 | 3.4% | |
| W | 19167 | 1.1% | |
| R | 19167 | 1.1% | |
| S | 17403 | 1.0% | |
| C | 8742 | 0.5% | |
| E | 8661 | 0.5% | |
| I | 6921 | 0.4% | |
| L | 6921 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1708503 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| P | 281105 | 16.5% | |
| U | 220884 | 12.9% | |
| T | 204778 | 12.0% | |
| H | 191126 | 11.2% | |
| K | 182036 | 10.7% | |
| N | 171530 | 10.0% | |
| V | 122742 | 7.2% | |
| M | 107269 | 6.3% | |
| O | 82036 | 4.8% | |
| B | 58015 | 3.4% | |
| W | 19167 | 1.1% | |
| R | 19167 | 1.1% | |
| S | 17403 | 1.0% | |
| C | 8742 | 0.5% | |
| E | 8661 | 0.5% | |
| I | 6921 | 0.4% | |
| L | 6921 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1708503 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| P | 281105 | 16.5% | |
| U | 220884 | 12.9% | |
| T | 204778 | 12.0% | |
| H | 191126 | 11.2% | |
| K | 182036 | 10.7% | |
| N | 171530 | 10.0% | |
| V | 122742 | 7.2% | |
| M | 107269 | 6.3% | |
| O | 82036 | 4.8% | |
| B | 58015 | 3.4% | |
| W | 19167 | 1.1% | |
| R | 19167 | 1.1% | |
| S | 17403 | 1.0% | |
| C | 8742 | 0.5% | |
| E | 8661 | 0.5% | |
| I | 6921 | 0.4% | |
| L | 6921 | 0.4% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 569108 |
| Missing (%) | 99.9% |
| Memory size | 4.3 MiB |
| OTH | |
|---|---|
| UNK | |
| PUB | |
| PVT | |
| MIL | 4 |
| Value | Count | Frequency (%) | |
| OTH | 190 | < 0.1% | |
| UNK | 83 | < 0.1% | |
| PUB | 71 | < 0.1% | |
| PVT | 45 | < 0.1% | |
| MIL | 4 | < 0.1% | |
| (Missing) | 569108 | 99.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1138216 | 66.6% | |
| a | 569108 | 33.3% | |
| T | 235 | < 0.1% | |
| O | 190 | < 0.1% | |
| H | 190 | < 0.1% | |
| U | 154 | < 0.1% | |
| P | 116 | < 0.1% | |
| N | 83 | < 0.1% | |
| K | 83 | < 0.1% | |
| B | 71 | < 0.1% | |
| V | 45 | < 0.1% | |
| M | 4 | < 0.1% | |
| I | 4 | < 0.1% | |
| L | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1707324 | 99.9% | |
| Uppercase Letter | 1179 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1138216 | 66.7% | |
| a | 569108 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| T | 235 | 19.9% | |
| O | 190 | 16.1% | |
| H | 190 | 16.1% | |
| U | 154 | 13.1% | |
| P | 116 | 9.8% | |
| N | 83 | 7.0% | |
| K | 83 | 7.0% | |
| B | 71 | 6.0% | |
| V | 45 | 3.8% | |
| M | 4 | 0.3% | |
| I | 4 | 0.3% | |
| L | 4 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1708503 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1138216 | 66.6% | |
| a | 569108 | 33.3% | |
| T | 235 | < 0.1% | |
| O | 190 | < 0.1% | |
| H | 190 | < 0.1% | |
| U | 154 | < 0.1% | |
| P | 116 | < 0.1% | |
| N | 83 | < 0.1% | |
| K | 83 | < 0.1% | |
| B | 71 | < 0.1% | |
| V | 45 | < 0.1% | |
| M | 4 | < 0.1% | |
| I | 4 | < 0.1% | |
| L | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1708503 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1138216 | 66.6% | |
| a | 569108 | 33.3% | |
| T | 235 | < 0.1% | |
| O | 190 | < 0.1% | |
| H | 190 | < 0.1% | |
| U | 154 | < 0.1% | |
| P | 116 | < 0.1% | |
| N | 83 | < 0.1% | |
| K | 83 | < 0.1% | |
| B | 71 | < 0.1% | |
| V | 45 | < 0.1% | |
| M | 4 | < 0.1% | |
| I | 4 | < 0.1% | |
| L | 4 | < 0.1% |
| Distinct | 218020 |
|---|---|
| Distinct (%) | 65.0% |
| Missing | 234118 |
| Missing (%) | 41.1% |
| Memory size | 4.3 MiB |
| None | |
|---|---|
| none | 14425 |
| unknown | 6127 |
| Unknown | 4979 |
| No | 2547 |
| Other values (218015) |
| Value | Count | Frequency (%) | |
| None | 32076 | 5.6% | |
| none | 14425 | 2.5% | |
| unknown | 6127 | 1.1% | |
| Unknown | 4979 | 0.9% | |
| No | 2547 | 0.4% | |
| NONE | 2276 | 0.4% | |
| no | 1593 | 0.3% | |
| UNKNOWN | 1293 | 0.2% | |
| None. | 1144 | 0.2% | |
| Tylenol | 771 | 0.1% | |
| N/a | 703 | 0.1% | |
| Levothyroxine | 688 | 0.1% | |
| Multivitamin | 640 | 0.1% | |
| Birth control | 628 | 0.1% | |
| Synthroid | 523 | 0.1% | |
| Zyrtec | 511 | 0.1% | |
| None reported | 486 | 0.1% | |
| Vitamin D | 475 | 0.1% | |
| Ibuprofen | 446 | 0.1% | |
| LEVOTHYROXINE | 432 | 0.1% | |
| na | 378 | 0.1% | |
| none known | 374 | 0.1% | |
| SYNTHROID | 374 | 0.1% | |
| TYLENOL | 370 | 0.1% | |
| none reported | 357 | 0.1% | |
| Other values (217995) | 260767 | 45.8% | |
| (Missing) | 234118 | 41.1% |
Frequencies of value counts
Unique
| Unique | 210441 ? |
|---|---|
| Unique (%) | 62.7% |
Histogram of lengths of the category
Length
| Max length | 240 |
|---|---|
| Median length | 4 |
| Mean length | 29.98529063 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1980367 | 11.6% | ||
| n | 1341324 | 7.9% | |
| a | 1151838 | 6.7% | |
| i | 1117432 | 6.5% | |
| e | 918269 | 5.4% | |
| o | 841754 | 4.9% | |
| t | 781859 | 4.6% | |
| l | 659814 | 3.9% | |
| r | 647225 | 3.8% | |
| m | 547425 | 3.2% | |
| , | 480146 | 2.8% | |
| s | 383811 | 2.2% | |
| p | 292684 | 1.7% | |
| d | 292085 | 1.7% | |
| c | 290933 | 1.7% | |
| u | 252365 | 1.5% | |
| A | 251860 | 1.5% | |
| g | 232202 | 1.4% | |
| I | 226753 | 1.3% | |
| y | 209609 | 1.2% | |
| 0 | 206679 | 1.2% | |
| N | 205321 | 1.2% | |
| O | 194623 | 1.1% | |
| L | 194553 | 1.1% | |
| T | 179255 | 1.0% | |
| Other values (73) | 3196467 | 18.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 10805255 | 63.3% | |
| Uppercase Letter | 2813788 | 16.5% | |
| Space Separator | 1980367 | 11.6% | |
| Other Punctuation | 719982 | 4.2% | |
| Decimal Number | 598004 | 3.5% | |
| Dash Punctuation | 54196 | 0.3% | |
| Open Punctuation | 49994 | 0.3% | |
| Close Punctuation | 49025 | 0.3% | |
| Math Symbol | 5571 | < 0.1% | |
| Other Symbol | 240 | < 0.1% | |
| Connector Punctuation | 201 | < 0.1% | |
| Modifier Symbol | 19 | < 0.1% | |
| Currency Symbol | 9 | < 0.1% | |
| Control | 2 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 251860 | 9.0% | |
| I | 226753 | 8.1% | |
| N | 205321 | 7.3% | |
| O | 194623 | 6.9% | |
| L | 194553 | 6.9% | |
| T | 179255 | 6.4% | |
| E | 177824 | 6.3% | |
| C | 160885 | 5.7% | |
| M | 159088 | 5.7% | |
| R | 146815 | 5.2% | |
| D | 136586 | 4.9% | |
| S | 123720 | 4.4% | |
| P | 114456 | 4.1% | |
| V | 104217 | 3.7% | |
| B | 77758 | 2.8% | |
| H | 57989 | 2.1% | |
| F | 53996 | 1.9% | |
| U | 51528 | 1.8% | |
| Z | 44700 | 1.6% | |
| G | 44051 | 1.6% | |
| Y | 35692 | 1.3% | |
| X | 30348 | 1.1% | |
| Q | 13886 | 0.5% | |
| W | 12924 | 0.5% | |
| K | 10430 | 0.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1341324 | 12.4% | |
| a | 1151838 | 10.7% | |
| i | 1117432 | 10.3% | |
| e | 918269 | 8.5% | |
| o | 841754 | 7.8% | |
| t | 781859 | 7.2% | |
| l | 659814 | 6.1% | |
| r | 647225 | 6.0% | |
| m | 547425 | 5.1% | |
| s | 383811 | 3.6% | |
| p | 292684 | 2.7% | |
| d | 292085 | 2.7% | |
| c | 290933 | 2.7% | |
| u | 252365 | 2.3% | |
| g | 232202 | 2.1% | |
| y | 209609 | 1.9% | |
| v | 167743 | 1.6% | |
| h | 160087 | 1.5% | |
| b | 133773 | 1.2% | |
| x | 105049 | 1.0% | |
| f | 96710 | 0.9% | |
| z | 78502 | 0.7% | |
| w | 41324 | 0.4% | |
| k | 40913 | 0.4% | |
| q | 15785 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1980367 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 480146 | 66.7% | |
| ; | 122002 | 16.9% | |
| . | 61324 | 8.5% | |
| / | 32122 | 4.5% | |
| : | 6859 | 1.0% | |
| ' | 4173 | 0.6% | |
| ? | 4119 | 0.6% | |
| & | 3461 | 0.5% | |
| % | 3345 | 0.5% | |
| * | 806 | 0.1% | |
| " | 773 | 0.1% | |
| @ | 365 | 0.1% | |
| # | 319 | < 0.1% | |
| ! | 132 | < 0.1% | |
| \ | 35 | < 0.1% | |
| ; | 1 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 206679 | 34.6% | |
| 1 | 115435 | 19.3% | |
| 2 | 88085 | 14.7% | |
| 5 | 79418 | 13.3% | |
| 3 | 44017 | 7.4% | |
| 4 | 22455 | 3.8% | |
| 8 | 17397 | 2.9% | |
| 6 | 11391 | 1.9% | |
| 7 | 9232 | 1.5% | |
| 9 | 3895 | 0.7% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 54196 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 35168 | 70.3% | |
| [ | 14808 | 29.6% | |
| { | 18 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 34821 | 71.0% | |
| ] | 14190 | 28.9% | |
| } | 14 | < 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 3459 | 62.1% | |
| = | 1823 | 32.7% | |
| ~ | 107 | 1.9% | |
| > | 78 | 1.4% | |
| | | 77 | 1.4% | |
| < | 27 | 0.5% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| � | 240 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 15 | 78.9% | |
| ^ | 4 | 21.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 201 | 100.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 9 | 100.0% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 2 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 13619043 | 79.8% | |
| Common | 3457610 | 20.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1341324 | 9.8% | |
| a | 1151838 | 8.5% | |
| i | 1117432 | 8.2% | |
| e | 918269 | 6.7% | |
| o | 841754 | 6.2% | |
| t | 781859 | 5.7% | |
| l | 659814 | 4.8% | |
| r | 647225 | 4.8% | |
| m | 547425 | 4.0% | |
| s | 383811 | 2.8% | |
| p | 292684 | 2.1% | |
| d | 292085 | 2.1% | |
| c | 290933 | 2.1% | |
| u | 252365 | 1.9% | |
| A | 251860 | 1.8% | |
| g | 232202 | 1.7% | |
| I | 226753 | 1.7% | |
| y | 209609 | 1.5% | |
| N | 205321 | 1.5% | |
| O | 194623 | 1.4% | |
| L | 194553 | 1.4% | |
| T | 179255 | 1.3% | |
| E | 177824 | 1.3% | |
| v | 167743 | 1.2% | |
| C | 160885 | 1.2% | |
| Other values (27) | 1899597 | 13.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1980367 | 57.3% | ||
| , | 480146 | 13.9% | |
| 0 | 206679 | 6.0% | |
| ; | 122002 | 3.5% | |
| 1 | 115435 | 3.3% | |
| 2 | 88085 | 2.5% | |
| 5 | 79418 | 2.3% | |
| . | 61324 | 1.8% | |
| - | 54196 | 1.6% | |
| 3 | 44017 | 1.3% | |
| ( | 35168 | 1.0% | |
| ) | 34821 | 1.0% | |
| / | 32122 | 0.9% | |
| 4 | 22455 | 0.6% | |
| 8 | 17397 | 0.5% | |
| [ | 14808 | 0.4% | |
| ] | 14190 | 0.4% | |
| 6 | 11391 | 0.3% | |
| 7 | 9232 | 0.3% | |
| : | 6859 | 0.2% | |
| ' | 4173 | 0.1% | |
| ? | 4119 | 0.1% | |
| 9 | 3895 | 0.1% | |
| & | 3461 | 0.1% | |
| + | 3459 | 0.1% | |
| Other values (21) | 8391 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 17076412 | > 99.9% | |
| Specials | 240 | < 0.1% | |
| None | 1 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1980367 | 11.6% | ||
| n | 1341324 | 7.9% | |
| a | 1151838 | 6.7% | |
| i | 1117432 | 6.5% | |
| e | 918269 | 5.4% | |
| o | 841754 | 4.9% | |
| t | 781859 | 4.6% | |
| l | 659814 | 3.9% | |
| r | 647225 | 3.8% | |
| m | 547425 | 3.2% | |
| , | 480146 | 2.8% | |
| s | 383811 | 2.2% | |
| p | 292684 | 1.7% | |
| d | 292085 | 1.7% | |
| c | 290933 | 1.7% | |
| u | 252365 | 1.5% | |
| A | 251860 | 1.5% | |
| g | 232202 | 1.4% | |
| I | 226753 | 1.3% | |
| y | 209609 | 1.2% | |
| 0 | 206679 | 1.2% | |
| N | 205321 | 1.2% | |
| O | 194623 | 1.1% | |
| L | 194553 | 1.1% | |
| T | 179255 | 1.0% | |
| Other values (71) | 3196226 | 18.7% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
| � | 240 | 100.0% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| ; | 1 | 100.0% |
| Distinct | 51645 |
|---|---|
| Distinct (%) | 19.5% |
| Missing | 304713 |
| Missing (%) | 53.5% |
| Memory size | 4.3 MiB |
| None | |
|---|---|
| none | |
| No | 9841 |
| no | 6366 |
| NONE | 5423 |
| Other values (51640) |
| Value | Count | Frequency (%) | |
| None | 96724 | 17.0% | |
| none | 46819 | 8.2% | |
| No | 9841 | 1.7% | |
| no | 6366 | 1.1% | |
| NONE | 5423 | 1.0% | |
| unknown | 4740 | 0.8% | |
| Unknown | 3740 | 0.7% | |
| None. | 3542 | 0.6% | |
| N/a | 1685 | 0.3% | |
| None known | 1038 | 0.2% | |
| None reported | 1026 | 0.2% | |
| UNKNOWN | 989 | 0.2% | |
| none known | 938 | 0.2% | |
| none reported | 860 | 0.2% | |
| NO | 850 | 0.1% | |
| Asthma | 706 | 0.1% | |
| na | 635 | 0.1% | |
| Hypertension | 607 | 0.1% | |
| Diabetes | 532 | 0.1% | |
| Na | 511 | 0.1% | |
| Seasonal allergies | 419 | 0.1% | |
| No. | 358 | 0.1% | |
| UTI | 321 | 0.1% | |
| 0 | 319 | 0.1% | |
| Abstains from alcohol; Non-smoker | 316 | 0.1% | |
| Other values (51620) | 75483 | 13.3% | |
| (Missing) | 304713 | 53.5% |
Frequencies of value counts
Unique
| Unique | 48436 ? |
|---|---|
| Unique (%) | 18.3% |
Histogram of lengths of the category
Length
| Max length | 4668 |
|---|---|
| Median length | 3 |
| Mean length | 9.865250456 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1052484 | 18.7% | |
| a | 539270 | 9.6% | |
| 523906 | 9.3% | ||
| e | 497118 | 8.8% | |
| o | 409518 | 7.3% | |
| i | 257739 | 4.6% | |
| t | 219354 | 3.9% | |
| s | 205934 | 3.7% | |
| r | 205596 | 3.7% | |
| N | 152601 | 2.7% | |
| l | 143832 | 2.6% | |
| d | 112016 | 2.0% | |
| c | 99250 | 1.8% | |
| h | 98851 | 1.8% | |
| m | 71117 | 1.3% | |
| u | 70407 | 1.3% | |
| y | 69414 | 1.2% | |
| p | 68809 | 1.2% | |
| g | 64590 | 1.1% | |
| f | 44437 | 0.8% | |
| b | 37608 | 0.7% | |
| w | 36730 | 0.7% | |
| v | 33893 | 0.6% | |
| , | 32460 | 0.6% | |
| k | 29373 | 0.5% | |
| Other values (72) | 541963 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 4382525 | 78.0% | |
| Space Separator | 523906 | 9.3% | |
| Uppercase Letter | 455857 | 8.1% | |
| Other Punctuation | 113201 | 2.0% | |
| Decimal Number | 93852 | 1.7% | |
| Open Punctuation | 18431 | 0.3% | |
| Close Punctuation | 18415 | 0.3% | |
| Dash Punctuation | 11378 | 0.2% | |
| Math Symbol | 563 | < 0.1% | |
| Other Symbol | 66 | < 0.1% | |
| Connector Punctuation | 41 | < 0.1% | |
| Control | 31 | < 0.1% | |
| Modifier Symbol | 3 | < 0.1% | |
| Currency Symbol | 1 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 152601 | 33.5% | |
| A | 28182 | 6.2% | |
| D | 25449 | 5.6% | |
| O | 24695 | 5.4% | |
| I | 23857 | 5.2% | |
| C | 22951 | 5.0% | |
| E | 22355 | 4.9% | |
| S | 20974 | 4.6% | |
| H | 18716 | 4.1% | |
| P | 16386 | 3.6% | |
| T | 16128 | 3.5% | |
| R | 13469 | 3.0% | |
| M | 10588 | 2.3% | |
| U | 10331 | 2.3% | |
| B | 9135 | 2.0% | |
| L | 8692 | 1.9% | |
| V | 7777 | 1.7% | |
| F | 6847 | 1.5% | |
| G | 5187 | 1.1% | |
| K | 3357 | 0.7% | |
| W | 2905 | 0.6% | |
| Y | 2299 | 0.5% | |
| J | 1743 | 0.4% | |
| X | 595 | 0.1% | |
| Z | 459 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1052484 | 24.0% | |
| a | 539270 | 12.3% | |
| e | 497118 | 11.3% | |
| o | 409518 | 9.3% | |
| i | 257739 | 5.9% | |
| t | 219354 | 5.0% | |
| s | 205934 | 4.7% | |
| r | 205596 | 4.7% | |
| l | 143832 | 3.3% | |
| d | 112016 | 2.6% | |
| c | 99250 | 2.3% | |
| h | 98851 | 2.3% | |
| m | 71117 | 1.6% | |
| u | 70407 | 1.6% | |
| y | 69414 | 1.6% | |
| p | 68809 | 1.6% | |
| g | 64590 | 1.5% | |
| f | 44437 | 1.0% | |
| b | 37608 | 0.9% | |
| w | 36730 | 0.8% | |
| v | 33893 | 0.8% | |
| k | 29373 | 0.7% | |
| x | 8505 | 0.2% | |
| z | 3522 | 0.1% | |
| j | 2175 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 523906 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 32460 | 28.7% | |
| . | 29049 | 25.7% | |
| ; | 24820 | 21.9% | |
| / | 18328 | 16.2% | |
| ' | 2464 | 2.2% | |
| : | 2171 | 1.9% | |
| ? | 2120 | 1.9% | |
| " | 629 | 0.6% | |
| & | 464 | 0.4% | |
| # | 208 | 0.2% | |
| * | 158 | 0.1% | |
| % | 157 | 0.1% | |
| ! | 124 | 0.1% | |
| @ | 25 | < 0.1% | |
| \ | 24 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 11378 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 27008 | 28.8% | |
| 1 | 23304 | 24.8% | |
| 0 | 16975 | 18.1% | |
| 9 | 6059 | 6.5% | |
| 3 | 5260 | 5.6% | |
| 4 | 3865 | 4.1% | |
| 5 | 3652 | 3.9% | |
| 8 | 2678 | 2.9% | |
| 6 | 2656 | 2.8% | |
| 7 | 2395 | 2.6% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 18326 | 99.4% | |
| [ | 100 | 0.5% | |
| { | 5 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 18312 | 99.4% | |
| ] | 100 | 0.5% | |
| } | 3 | < 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 343 | 60.9% | |
| > | 66 | 11.7% | |
| ~ | 59 | 10.5% | |
| = | 45 | 8.0% | |
| | | 28 | 5.0% | |
| < | 22 | 3.9% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| � | 66 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 2 | 66.7% | |
| ^ | 1 | 33.3% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 1 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 41 | 100.0% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 31 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 4838382 | 86.1% | |
| Common | 779888 | 13.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1052484 | 21.8% | |
| a | 539270 | 11.1% | |
| e | 497118 | 10.3% | |
| o | 409518 | 8.5% | |
| i | 257739 | 5.3% | |
| t | 219354 | 4.5% | |
| s | 205934 | 4.3% | |
| r | 205596 | 4.2% | |
| N | 152601 | 3.2% | |
| l | 143832 | 3.0% | |
| d | 112016 | 2.3% | |
| c | 99250 | 2.1% | |
| h | 98851 | 2.0% | |
| m | 71117 | 1.5% | |
| u | 70407 | 1.5% | |
| y | 69414 | 1.4% | |
| p | 68809 | 1.4% | |
| g | 64590 | 1.3% | |
| f | 44437 | 0.9% | |
| b | 37608 | 0.8% | |
| w | 36730 | 0.8% | |
| v | 33893 | 0.7% | |
| k | 29373 | 0.6% | |
| A | 28182 | 0.6% | |
| D | 25449 | 0.5% | |
| Other values (27) | 264810 | 5.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 523906 | 67.2% | ||
| , | 32460 | 4.2% | |
| . | 29049 | 3.7% | |
| 2 | 27008 | 3.5% | |
| ; | 24820 | 3.2% | |
| 1 | 23304 | 3.0% | |
| / | 18328 | 2.4% | |
| ( | 18326 | 2.3% | |
| ) | 18312 | 2.3% | |
| 0 | 16975 | 2.2% | |
| - | 11378 | 1.5% | |
| 9 | 6059 | 0.8% | |
| 3 | 5260 | 0.7% | |
| 4 | 3865 | 0.5% | |
| 5 | 3652 | 0.5% | |
| 8 | 2678 | 0.3% | |
| 6 | 2656 | 0.3% | |
| ' | 2464 | 0.3% | |
| 7 | 2395 | 0.3% | |
| : | 2171 | 0.3% | |
| ? | 2120 | 0.3% | |
| " | 629 | 0.1% | |
| & | 464 | 0.1% | |
| + | 343 | < 0.1% | |
| # | 208 | < 0.1% | |
| Other values (20) | 1058 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5618204 | > 99.9% | |
| Specials | 66 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1052484 | 18.7% | |
| a | 539270 | 9.6% | |
| 523906 | 9.3% | ||
| e | 497118 | 8.8% | |
| o | 409518 | 7.3% | |
| i | 257739 | 4.6% | |
| t | 219354 | 3.9% | |
| s | 205934 | 3.7% | |
| r | 205596 | 3.7% | |
| N | 152601 | 2.7% | |
| l | 143832 | 2.6% | |
| d | 112016 | 2.0% | |
| c | 99250 | 1.8% | |
| h | 98851 | 1.8% | |
| m | 71117 | 1.3% | |
| u | 70407 | 1.3% | |
| y | 69414 | 1.2% | |
| p | 68809 | 1.2% | |
| g | 64590 | 1.1% | |
| f | 44437 | 0.8% | |
| b | 37608 | 0.7% | |
| w | 36730 | 0.7% | |
| v | 33893 | 0.6% | |
| , | 32460 | 0.6% | |
| k | 29373 | 0.5% | |
| Other values (71) | 541897 | 9.6% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
| � | 66 | 100.0% |
| Distinct | 146407 |
|---|---|
| Distinct (%) | 40.9% |
| Missing | 211582 |
| Missing (%) | 37.2% |
| Memory size | 4.3 MiB |
| None | |
|---|---|
| none | 23493 |
| Comments: Unknown | 6979 |
| No | 5681 |
| Asthma | 5171 |
| Other values (146402) |
| Value | Count | Frequency (%) | |
| None | 50414 | 8.9% | |
| none | 23493 | 4.1% | |
| Comments: Unknown | 6979 | 1.2% | |
| No | 5681 | 1.0% | |
| Asthma | 5171 | 0.9% | |
| Medical History/Concurrent Conditions: No adverse event | 4130 | 0.7% | |
| Comments: List of non-encoded Patient Relevant History: Patient Other Relevant History 1: None | 4038 | 0.7% | |
| unknown | 3839 | 0.7% | |
| no | 3435 | 0.6% | |
| NONE | 3192 | 0.6% | |
| Unknown | 3110 | 0.5% | |
| Comments: No medical history was provided by the reporter. | 2504 | 0.4% | |
| Medical History/Concurrent Conditions: No adverse event (No reported medical history) | 2187 | 0.4% | |
| Hypertension | 1998 | 0.4% | |
| None. | 1707 | 0.3% | |
| Medical History/Concurrent Conditions: No adverse event (No medical history reported) | 1694 | 0.3% | |
| High blood pressure | 1620 | 0.3% | |
| Medical History/Concurrent Conditions: COVID-19 | 1620 | 0.3% | |
| asthma | 1529 | 0.3% | |
| Medical History/Concurrent Conditions: No adverse event (No reported medical history.) | 1443 | 0.3% | |
| Hypothyroidism | 1377 | 0.2% | |
| Diabetes | 1214 | 0.2% | |
| HTN | 989 | 0.2% | |
| N/a | 967 | 0.2% | |
| Migraines | 926 | 0.2% | |
| Other values (146382) | 222662 | 39.1% | |
| (Missing) | 211582 | 37.2% |
Frequencies of value counts
Unique
| Unique | 137188 ? |
|---|---|
| Unique (%) | 38.3% |
Histogram of lengths of the category
Length
| Max length | 10700 |
|---|---|
| Median length | 4 |
| Mean length | 28.39772538 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1731328 | 10.7% | ||
| n | 1400391 | 8.7% | |
| e | 1366370 | 8.4% | |
| i | 1111170 | 6.9% | |
| o | 1067088 | 6.6% | |
| a | 1015457 | 6.3% | |
| t | 906370 | 5.6% | |
| r | 893539 | 5.5% | |
| s | 841904 | 5.2% | |
| l | 491510 | 3.0% | |
| d | 480301 | 3.0% | |
| c | 401777 | 2.5% | |
| h | 356023 | 2.2% | |
| y | 339801 | 2.1% | |
| m | 291806 | 1.8% | |
| p | 265801 | 1.6% | |
| u | 246916 | 1.5% | |
| C | 203933 | 1.3% | |
| , | 189244 | 1.2% | |
| g | 180190 | 1.1% | |
| H | 158707 | 1.0% | |
| N | 145004 | 0.9% | |
| b | 128778 | 0.8% | |
| v | 117290 | 0.7% | |
| f | 99871 | 0.6% | |
| Other values (73) | 1741964 | 10.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 12178111 | 75.3% | |
| Space Separator | 1731328 | 10.7% | |
| Uppercase Letter | 1440943 | 8.9% | |
| Other Punctuation | 499646 | 3.1% | |
| Decimal Number | 165542 | 1.0% | |
| Close Punctuation | 58923 | 0.4% | |
| Open Punctuation | 58762 | 0.4% | |
| Dash Punctuation | 36650 | 0.2% | |
| Math Symbol | 1310 | < 0.1% | |
| Connector Punctuation | 851 | < 0.1% | |
| Other Symbol | 402 | < 0.1% | |
| Control | 53 | < 0.1% | |
| Modifier Symbol | 10 | < 0.1% | |
| Currency Symbol | 2 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 203933 | 14.2% | |
| H | 158707 | 11.0% | |
| N | 145004 | 10.1% | |
| A | 98351 | 6.8% | |
| D | 94335 | 6.5% | |
| M | 92497 | 6.4% | |
| S | 72023 | 5.0% | |
| P | 71157 | 4.9% | |
| I | 69200 | 4.8% | |
| O | 67667 | 4.7% | |
| E | 61265 | 4.3% | |
| R | 57141 | 4.0% | |
| T | 56776 | 3.9% | |
| L | 35502 | 2.5% | |
| B | 35416 | 2.5% | |
| U | 24154 | 1.7% | |
| V | 23797 | 1.7% | |
| G | 23156 | 1.6% | |
| F | 21112 | 1.5% | |
| Y | 9501 | 0.7% | |
| K | 9411 | 0.7% | |
| W | 4665 | 0.3% | |
| J | 2816 | 0.2% | |
| X | 1927 | 0.1% | |
| Z | 977 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1400391 | 11.5% | |
| e | 1366370 | 11.2% | |
| i | 1111170 | 9.1% | |
| o | 1067088 | 8.8% | |
| a | 1015457 | 8.3% | |
| t | 906370 | 7.4% | |
| r | 893539 | 7.3% | |
| s | 841904 | 6.9% | |
| l | 491510 | 4.0% | |
| d | 480301 | 3.9% | |
| c | 401777 | 3.3% | |
| h | 356023 | 2.9% | |
| y | 339801 | 2.8% | |
| m | 291806 | 2.4% | |
| p | 265801 | 2.2% | |
| u | 246916 | 2.0% | |
| g | 180190 | 1.5% | |
| b | 128778 | 1.1% | |
| v | 117290 | 1.0% | |
| f | 99871 | 0.8% | |
| w | 70874 | 0.6% | |
| k | 55784 | 0.5% | |
| x | 31541 | 0.3% | |
| z | 8977 | 0.1% | |
| j | 6313 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1731328 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 189244 | 37.9% | |
| : | 95827 | 19.2% | |
| / | 74915 | 15.0% | |
| ; | 63099 | 12.6% | |
| . | 55908 | 11.2% | |
| ' | 9240 | 1.8% | |
| ? | 7451 | 1.5% | |
| & | 1376 | 0.3% | |
| " | 1142 | 0.2% | |
| * | 536 | 0.1% | |
| % | 384 | 0.1% | |
| # | 320 | 0.1% | |
| ! | 153 | < 0.1% | |
| @ | 33 | < 0.1% | |
| \ | 18 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 41210 | 24.9% | |
| 2 | 37751 | 22.8% | |
| 0 | 30520 | 18.4% | |
| 9 | 17167 | 10.4% | |
| 3 | 9195 | 5.6% | |
| 5 | 7381 | 4.5% | |
| 4 | 6528 | 3.9% | |
| 8 | 5677 | 3.4% | |
| 6 | 5128 | 3.1% | |
| 7 | 4985 | 3.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 36650 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 58266 | 99.2% | |
| [ | 489 | 0.8% | |
| { | 7 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 58446 | 99.2% | |
| ] | 473 | 0.8% | |
| } | 4 | < 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 577 | 44.0% | |
| = | 294 | 22.4% | |
| > | 183 | 14.0% | |
| ~ | 135 | 10.3% | |
| < | 95 | 7.3% | |
| | | 26 | 2.0% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| � | 402 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 851 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ^ | 9 | 90.0% | |
| ` | 1 | 10.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 2 | 100.0% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 31 | 58.5% | ||
| | 22 | 41.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 13619054 | 84.2% | |
| Common | 2553479 | 15.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1400391 | 10.3% | |
| e | 1366370 | 10.0% | |
| i | 1111170 | 8.2% | |
| o | 1067088 | 7.8% | |
| a | 1015457 | 7.5% | |
| t | 906370 | 6.7% | |
| r | 893539 | 6.6% | |
| s | 841904 | 6.2% | |
| l | 491510 | 3.6% | |
| d | 480301 | 3.5% | |
| c | 401777 | 3.0% | |
| h | 356023 | 2.6% | |
| y | 339801 | 2.5% | |
| m | 291806 | 2.1% | |
| p | 265801 | 2.0% | |
| u | 246916 | 1.8% | |
| C | 203933 | 1.5% | |
| g | 180190 | 1.3% | |
| H | 158707 | 1.2% | |
| N | 145004 | 1.1% | |
| b | 128778 | 0.9% | |
| v | 117290 | 0.9% | |
| f | 99871 | 0.7% | |
| A | 98351 | 0.7% | |
| D | 94335 | 0.7% | |
| Other values (27) | 916371 | 6.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1731328 | 67.8% | ||
| , | 189244 | 7.4% | |
| : | 95827 | 3.8% | |
| / | 74915 | 2.9% | |
| ; | 63099 | 2.5% | |
| ) | 58446 | 2.3% | |
| ( | 58266 | 2.3% | |
| . | 55908 | 2.2% | |
| 1 | 41210 | 1.6% | |
| 2 | 37751 | 1.5% | |
| - | 36650 | 1.4% | |
| 0 | 30520 | 1.2% | |
| 9 | 17167 | 0.7% | |
| ' | 9240 | 0.4% | |
| 3 | 9195 | 0.4% | |
| ? | 7451 | 0.3% | |
| 5 | 7381 | 0.3% | |
| 4 | 6528 | 0.3% | |
| 8 | 5677 | 0.2% | |
| 6 | 5128 | 0.2% | |
| 7 | 4985 | 0.2% | |
| & | 1376 | 0.1% | |
| " | 1142 | < 0.1% | |
| _ | 851 | < 0.1% | |
| + | 577 | < 0.1% | |
| Other values (21) | 3617 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 16172131 | > 99.9% | |
| Specials | 402 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1731328 | 10.7% | ||
| n | 1400391 | 8.7% | |
| e | 1366370 | 8.4% | |
| i | 1111170 | 6.9% | |
| o | 1067088 | 6.6% | |
| a | 1015457 | 6.3% | |
| t | 906370 | 5.6% | |
| r | 893539 | 5.5% | |
| s | 841904 | 5.2% | |
| l | 491510 | 3.0% | |
| d | 480301 | 3.0% | |
| c | 401777 | 2.5% | |
| h | 356023 | 2.2% | |
| y | 339801 | 2.1% | |
| m | 291806 | 1.8% | |
| p | 265801 | 1.6% | |
| u | 246916 | 1.5% | |
| C | 203933 | 1.3% | |
| , | 189244 | 1.2% | |
| g | 180190 | 1.1% | |
| H | 158707 | 1.0% | |
| N | 145004 | 0.9% | |
| b | 128778 | 0.8% | |
| v | 117290 | 0.7% | |
| f | 99871 | 0.6% | |
| Other values (72) | 1741562 | 10.8% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
| � | 402 | 100.0% |
| Distinct | 23877 |
|---|---|
| Distinct (%) | 89.1% |
| Missing | 542711 |
| Missing (%) | 95.3% |
| Memory size | 4.3 MiB |
| Flu shot | 199 |
|---|---|
| Flu | 146 |
| Flu vaccine | 144 |
| Shingles | 133 |
| Shingrix | 89 |
| Other values (23872) |
| Value | Count | Frequency (%) | |
| Flu shot | 199 | < 0.1% | |
| Flu | 146 | < 0.1% | |
| Flu vaccine | 144 | < 0.1% | |
| Shingles | 133 | < 0.1% | |
| Shingrix | 89 | < 0.1% | |
| Tetanus | 82 | < 0.1% | |
| Influenza | 80 | < 0.1% | |
| flu vaccine | 67 | < 0.1% | |
| Moderna | 61 | < 0.1% | |
| flu shot | 61 | < 0.1% | |
| Sore arm | 54 | < 0.1% | |
| Flu Vaccine | 48 | < 0.1% | |
| MMR | 48 | < 0.1% | |
| unknown | 48 | < 0.1% | |
| Shingles vaccine | 42 | < 0.1% | |
| influenza | 41 | < 0.1% | |
| Pneumonia | 38 | < 0.1% | |
| Penicillin | 37 | < 0.1% | |
| Flu Shot | 35 | < 0.1% | |
| fainting | 34 | < 0.1% | |
| Influenza vaccine | 32 | < 0.1% | |
| flu | 32 | < 0.1% | |
| sore arm | 29 | < 0.1% | |
| shingles | 28 | < 0.1% | |
| Tdap | 27 | < 0.1% | |
| Other values (23852) | 25155 | 4.4% | |
| (Missing) | 542711 | 95.3% |
Frequencies of value counts
Unique
| Unique | 23377 ? |
|---|---|
| Unique (%) | 87.3% |
Histogram of lengths of the category
Length
| Max length | 128 |
|---|---|
| Median length | 3 |
| Mean length | 5.428116895 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1165879 | 37.7% | |
| a | 640076 | 20.7% | |
| 228137 | 7.4% | ||
| e | 127389 | 4.1% | |
| i | 82903 | 2.7% | |
| t | 77956 | 2.5% | |
| s | 74393 | 2.4% | |
| o | 70671 | 2.3% | |
| r | 60201 | 1.9% | |
| l | 46545 | 1.5% | |
| c | 46348 | 1.5% | |
| d | 41862 | 1.4% | |
| h | 41855 | 1.4% | |
| f | 29240 | 0.9% | |
| u | 28362 | 0.9% | |
| v | 24312 | 0.8% | |
| m | 23166 | 0.7% | |
| g | 21613 | 0.7% | |
| , | 18292 | 0.6% | |
| y | 17247 | 0.6% | |
| p | 16561 | 0.5% | |
| 2 | 15446 | 0.5% | |
| 1 | 15054 | 0.5% | |
| w | 13507 | 0.4% | |
| 0 | 11795 | 0.4% | |
| Other values (68) | 152508 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 2675998 | 86.6% | |
| Space Separator | 228137 | 7.4% | |
| Uppercase Letter | 73920 | 2.4% | |
| Decimal Number | 62641 | 2.0% | |
| Other Punctuation | 40175 | 1.3% | |
| Dash Punctuation | 6188 | 0.2% | |
| Open Punctuation | 2004 | 0.1% | |
| Close Punctuation | 1873 | 0.1% | |
| Math Symbol | 365 | < 0.1% | |
| Other Symbol | 9 | < 0.1% | |
| Connector Punctuation | 8 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1165879 | 43.6% | |
| a | 640076 | 23.9% | |
| e | 127389 | 4.8% | |
| i | 82903 | 3.1% | |
| t | 77956 | 2.9% | |
| s | 74393 | 2.8% | |
| o | 70671 | 2.6% | |
| r | 60201 | 2.2% | |
| l | 46545 | 1.7% | |
| c | 46348 | 1.7% | |
| d | 41862 | 1.6% | |
| h | 41855 | 1.6% | |
| f | 29240 | 1.1% | |
| u | 28362 | 1.1% | |
| v | 24312 | 0.9% | |
| m | 23166 | 0.9% | |
| g | 21613 | 0.8% | |
| y | 17247 | 0.6% | |
| p | 16561 | 0.6% | |
| w | 13507 | 0.5% | |
| b | 8873 | 0.3% | |
| k | 5463 | 0.2% | |
| x | 4382 | 0.2% | |
| z | 4105 | 0.2% | |
| j | 2847 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 228137 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 18292 | 45.5% | |
| . | 10391 | 25.9% | |
| / | 6802 | 16.9% | |
| ; | 1382 | 3.4% | |
| ' | 708 | 1.8% | |
| : | 688 | 1.7% | |
| # | 558 | 1.4% | |
| " | 543 | 1.4% | |
| ? | 412 | 1.0% | |
| & | 316 | 0.8% | |
| @ | 41 | 0.1% | |
| ! | 21 | 0.1% | |
| \ | 7 | < 0.1% | |
| * | 7 | < 0.1% | |
| % | 7 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 15446 | 24.7% | |
| 1 | 15054 | 24.0% | |
| 0 | 11795 | 18.8% | |
| 9 | 4650 | 7.4% | |
| 3 | 3528 | 5.6% | |
| 5 | 2911 | 4.6% | |
| 4 | 2761 | 4.4% | |
| 6 | 2428 | 3.9% | |
| 8 | 2121 | 3.4% | |
| 7 | 1947 | 3.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| I | 7699 | 10.4% | |
| S | 7449 | 10.1% | |
| F | 5768 | 7.8% | |
| A | 4886 | 6.6% | |
| P | 4798 | 6.5% | |
| M | 4457 | 6.0% | |
| T | 4378 | 5.9% | |
| C | 4270 | 5.8% | |
| V | 4123 | 5.6% | |
| D | 3912 | 5.3% | |
| O | 3571 | 4.8% | |
| E | 3354 | 4.5% | |
| R | 2932 | 4.0% | |
| N | 2702 | 3.7% | |
| H | 2560 | 3.5% | |
| L | 1609 | 2.2% | |
| B | 1308 | 1.8% | |
| U | 914 | 1.2% | |
| G | 870 | 1.2% | |
| W | 626 | 0.8% | |
| Y | 587 | 0.8% | |
| J | 458 | 0.6% | |
| Z | 309 | 0.4% | |
| K | 188 | 0.3% | |
| X | 128 | 0.2% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 6188 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 1996 | 99.6% | |
| [ | 7 | 0.3% | |
| { | 1 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 1864 | 99.5% | |
| ] | 8 | 0.4% | |
| } | 1 | 0.1% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| ~ | 181 | 49.6% | |
| + | 124 | 34.0% | |
| = | 26 | 7.1% | |
| > | 16 | 4.4% | |
| < | 13 | 3.6% | |
| | | 5 | 1.4% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| � | 9 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 8 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 2749918 | 89.0% | |
| Common | 341400 | 11.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1165879 | 42.4% | |
| a | 640076 | 23.3% | |
| e | 127389 | 4.6% | |
| i | 82903 | 3.0% | |
| t | 77956 | 2.8% | |
| s | 74393 | 2.7% | |
| o | 70671 | 2.6% | |
| r | 60201 | 2.2% | |
| l | 46545 | 1.7% | |
| c | 46348 | 1.7% | |
| d | 41862 | 1.5% | |
| h | 41855 | 1.5% | |
| f | 29240 | 1.1% | |
| u | 28362 | 1.0% | |
| v | 24312 | 0.9% | |
| m | 23166 | 0.8% | |
| g | 21613 | 0.8% | |
| y | 17247 | 0.6% | |
| p | 16561 | 0.6% | |
| w | 13507 | 0.5% | |
| b | 8873 | 0.3% | |
| I | 7699 | 0.3% | |
| S | 7449 | 0.3% | |
| F | 5768 | 0.2% | |
| k | 5463 | 0.2% | |
| Other values (27) | 64580 | 2.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 228137 | 66.8% | ||
| , | 18292 | 5.4% | |
| 2 | 15446 | 4.5% | |
| 1 | 15054 | 4.4% | |
| 0 | 11795 | 3.5% | |
| . | 10391 | 3.0% | |
| / | 6802 | 2.0% | |
| - | 6188 | 1.8% | |
| 9 | 4650 | 1.4% | |
| 3 | 3528 | 1.0% | |
| 5 | 2911 | 0.9% | |
| 4 | 2761 | 0.8% | |
| 6 | 2428 | 0.7% | |
| 8 | 2121 | 0.6% | |
| ( | 1996 | 0.6% | |
| 7 | 1947 | 0.6% | |
| ) | 1864 | 0.5% | |
| ; | 1382 | 0.4% | |
| ' | 708 | 0.2% | |
| : | 688 | 0.2% | |
| # | 558 | 0.2% | |
| " | 543 | 0.2% | |
| ? | 412 | 0.1% | |
| & | 316 | 0.1% | |
| ~ | 181 | 0.1% | |
| Other values (16) | 301 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3091309 | > 99.9% | |
| Specials | 9 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1165879 | 37.7% | |
| a | 640076 | 20.7% | |
| 228137 | 7.4% | ||
| e | 127389 | 4.1% | |
| i | 82903 | 2.7% | |
| t | 77956 | 2.5% | |
| s | 74393 | 2.4% | |
| o | 70671 | 2.3% | |
| r | 60201 | 1.9% | |
| l | 46545 | 1.5% | |
| c | 46348 | 1.5% | |
| d | 41862 | 1.4% | |
| h | 41855 | 1.4% | |
| f | 29240 | 0.9% | |
| u | 28362 | 0.9% | |
| v | 24312 | 0.8% | |
| m | 23166 | 0.7% | |
| g | 21613 | 0.7% | |
| , | 18292 | 0.6% | |
| y | 17247 | 0.6% | |
| p | 16561 | 0.5% | |
| 2 | 15446 | 0.5% | |
| 1 | 15054 | 0.5% | |
| w | 13507 | 0.4% | |
| 0 | 11795 | 0.4% | |
| Other values (67) | 152499 | 4.9% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
| � | 9 | 100.0% |
| Distinct | 77868 |
|---|---|
| Distinct (%) | 47.3% |
| Missing | 404977 |
| Missing (%) | 71.1% |
| Memory size | 4.3 MiB |
| USMODERNATX, INC.MOD20210 | |
|---|---|
| vsafe | |
| USMODERNATX, INC.MOD20212 | |
| USMODERNATX, INC.MOD20211 | |
| USMODERNATX, INC.MOD20213 | 1523 |
| Other values (77863) |
| Value | Count | Frequency (%) | |
| USMODERNATX, INC.MOD20210 | 51470 | 9.0% | |
| vsafe | 10605 | 1.9% | |
| USMODERNATX, INC.MOD20212 | 9847 | 1.7% | |
| USMODERNATX, INC.MOD20211 | 9249 | 1.6% | |
| USMODERNATX, INC.MOD20213 | 1523 | 0.3% | |
| USGLAXOSMITHKLINEUS2021AM | 1233 | 0.2% | |
| USMODERNATX, INC.MOD20200 | 807 | 0.1% | |
| USGLAXOSMITHKLINEUS202113 | 167 | < 0.1% | |
| USGLAXOSMITHKLINEUS2020AM | 124 | < 0.1% | |
| USGLAXOSMITHKLINEUS202112 | 117 | < 0.1% | |
| USGLAXOSMITHKLINEUS202115 | 115 | < 0.1% | |
| CA134B1001 | 94 | < 0.1% | |
| USEMERGENT BIOSOLUTIONS20 | 90 | < 0.1% | |
| USGLAXOSMITHKLINEUS202024 | 74 | < 0.1% | |
| USGLAXOSMITHKLINEUS202114 | 72 | < 0.1% | |
| USGLAXOSMITHKLINEUS202023 | 70 | < 0.1% | |
| USBAVARIAN NORDIC A/SUSBN | 67 | < 0.1% | |
| Unknown | 56 | < 0.1% | |
| USGLAXOSMITHKLINEUS2021GS | 56 | < 0.1% | |
| TX29 | 54 | < 0.1% | |
| USGLAXOSMITHKLINEUS202111 | 53 | < 0.1% | |
| USGLAXOSMITHKLINEUS202116 | 49 | < 0.1% | |
| USGLAXOSMITHKLINEUS202110 | 46 | < 0.1% | |
| USGLAXOSMITHKLINEUS202100 | 46 | < 0.1% | |
| unknown | 45 | < 0.1% | |
| Other values (77843) | 78395 | 13.8% | |
| (Missing) | 404977 | 71.1% |
Frequencies of value counts
Unique
| Unique | 77692 ? |
|---|---|
| Unique (%) | 47.2% |
Histogram of lengths of the category
Length
| Max length | 25 |
|---|---|
| Median length | 3 |
| Mean length | 8.465825345 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 810641 | 16.8% | |
| a | 415961 | 8.6% | |
| 2 | 361610 | 7.5% | |
| 0 | 288450 | 6.0% | |
| N | 222834 | 4.6% | |
| 1 | 215729 | 4.5% | |
| I | 199021 | 4.1% | |
| O | 162747 | 3.4% | |
| S | 160417 | 3.3% | |
| U | 156088 | 3.2% | |
| M | 150066 | 3.1% | |
| C | 147373 | 3.1% | |
| D | 146210 | 3.0% | |
| E | 136293 | 2.8% | |
| 134184 | 2.8% | ||
| R | 133996 | 2.8% | |
| A | 80046 | 1.7% | |
| T | 75673 | 1.6% | |
| X | 75418 | 1.6% | |
| F | 74005 | 1.5% | |
| , | 72958 | 1.5% | |
| . | 72951 | 1.5% | |
| P | 60610 | 1.3% | |
| Z | 60177 | 1.2% | |
| 3 | 55084 | 1.1% | |
| Other values (53) | 352754 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 2082317 | 43.2% | |
| Lowercase Letter | 1272589 | 26.4% | |
| Decimal Number | 1185698 | 24.6% | |
| Other Punctuation | 146178 | 3.0% | |
| Space Separator | 134184 | 2.8% | |
| Dash Punctuation | 319 | < 0.1% | |
| Open Punctuation | 5 | < 0.1% | |
| Close Punctuation | 3 | < 0.1% | |
| Connector Punctuation | 2 | < 0.1% | |
| Modifier Symbol | 1 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 810641 | 63.7% | |
| a | 415961 | 32.7% | |
| e | 11116 | 0.9% | |
| s | 10806 | 0.8% | |
| f | 10716 | 0.8% | |
| v | 10687 | 0.8% | |
| o | 505 | < 0.1% | |
| i | 289 | < 0.1% | |
| t | 259 | < 0.1% | |
| r | 252 | < 0.1% | |
| c | 218 | < 0.1% | |
| k | 168 | < 0.1% | |
| w | 164 | < 0.1% | |
| d | 137 | < 0.1% | |
| u | 128 | < 0.1% | |
| l | 113 | < 0.1% | |
| p | 105 | < 0.1% | |
| m | 80 | < 0.1% | |
| h | 79 | < 0.1% | |
| z | 46 | < 0.1% | |
| b | 45 | < 0.1% | |
| j | 28 | < 0.1% | |
| g | 23 | < 0.1% | |
| y | 20 | < 0.1% | |
| x | 2 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 361610 | 30.5% | |
| 0 | 288450 | 24.3% | |
| 1 | 215729 | 18.2% | |
| 3 | 55084 | 4.6% | |
| 4 | 53289 | 4.5% | |
| 5 | 49292 | 4.2% | |
| 6 | 42742 | 3.6% | |
| 7 | 40176 | 3.4% | |
| 8 | 39784 | 3.4% | |
| 9 | 39542 | 3.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 222834 | 10.7% | |
| I | 199021 | 9.6% | |
| O | 162747 | 7.8% | |
| S | 160417 | 7.7% | |
| U | 156088 | 7.5% | |
| M | 150066 | 7.2% | |
| C | 147373 | 7.1% | |
| D | 146210 | 7.0% | |
| E | 136293 | 6.5% | |
| R | 133996 | 6.4% | |
| A | 80046 | 3.8% | |
| T | 75673 | 3.6% | |
| X | 75418 | 3.6% | |
| F | 74005 | 3.6% | |
| P | 60610 | 2.9% | |
| Z | 60177 | 2.9% | |
| J | 27670 | 1.3% | |
| L | 4984 | 0.2% | |
| G | 2607 | 0.1% | |
| H | 2476 | 0.1% | |
| K | 2419 | 0.1% | |
| B | 406 | < 0.1% | |
| V | 379 | < 0.1% | |
| Q | 333 | < 0.1% | |
| W | 44 | < 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 319 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 134184 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 5 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 72958 | 49.9% | |
| . | 72951 | 49.9% | |
| / | 100 | 0.1% | |
| ? | 73 | < 0.1% | |
| # | 64 | < 0.1% | |
| : | 13 | < 0.1% | |
| ' | 12 | < 0.1% | |
| ; | 3 | < 0.1% | |
| & | 3 | < 0.1% | |
| @ | 1 | < 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 3 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 2 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 3354906 | 69.6% | |
| Common | 1466390 | 30.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 810641 | 24.2% | |
| a | 415961 | 12.4% | |
| N | 222834 | 6.6% | |
| I | 199021 | 5.9% | |
| O | 162747 | 4.9% | |
| S | 160417 | 4.8% | |
| U | 156088 | 4.7% | |
| M | 150066 | 4.5% | |
| C | 147373 | 4.4% | |
| D | 146210 | 4.4% | |
| E | 136293 | 4.1% | |
| R | 133996 | 4.0% | |
| A | 80046 | 2.4% | |
| T | 75673 | 2.3% | |
| X | 75418 | 2.2% | |
| F | 74005 | 2.2% | |
| P | 60610 | 1.8% | |
| Z | 60177 | 1.8% | |
| J | 27670 | 0.8% | |
| e | 11116 | 0.3% | |
| s | 10806 | 0.3% | |
| f | 10716 | 0.3% | |
| v | 10687 | 0.3% | |
| L | 4984 | 0.1% | |
| G | 2607 | 0.1% | |
| Other values (27) | 8744 | 0.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 361610 | 24.7% | |
| 0 | 288450 | 19.7% | |
| 1 | 215729 | 14.7% | |
| 134184 | 9.2% | ||
| , | 72958 | 5.0% | |
| . | 72951 | 5.0% | |
| 3 | 55084 | 3.8% | |
| 4 | 53289 | 3.6% | |
| 5 | 49292 | 3.4% | |
| 6 | 42742 | 2.9% | |
| 7 | 40176 | 2.7% | |
| 8 | 39784 | 2.7% | |
| 9 | 39542 | 2.7% | |
| - | 319 | < 0.1% | |
| / | 100 | < 0.1% | |
| ? | 73 | < 0.1% | |
| # | 64 | < 0.1% | |
| : | 13 | < 0.1% | |
| ' | 12 | < 0.1% | |
| ( | 5 | < 0.1% | |
| ) | 3 | < 0.1% | |
| ; | 3 | < 0.1% | |
| & | 3 | < 0.1% | |
| _ | 2 | < 0.1% | |
| @ | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4821296 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 810641 | 16.8% | |
| a | 415961 | 8.6% | |
| 2 | 361610 | 7.5% | |
| 0 | 288450 | 6.0% | |
| N | 222834 | 4.6% | |
| 1 | 215729 | 4.5% | |
| I | 199021 | 4.1% | |
| O | 162747 | 3.4% | |
| S | 160417 | 3.3% | |
| U | 156088 | 3.2% | |
| M | 150066 | 3.1% | |
| C | 147373 | 3.1% | |
| D | 146210 | 3.0% | |
| E | 136293 | 2.8% | |
| 134184 | 2.8% | ||
| R | 133996 | 2.8% | |
| A | 80046 | 1.7% | |
| T | 75673 | 1.6% | |
| X | 75418 | 1.6% | |
| F | 74005 | 1.5% | |
| , | 72958 | 1.5% | |
| . | 72951 | 1.5% | |
| P | 60610 | 1.3% | |
| Z | 60177 | 1.2% | |
| 3 | 55084 | 1.1% | |
| Other values (53) | 352754 | 7.3% |
FORM_VERS
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.3 MiB |
| 2 | |
|---|---|
| 1 | 393 |
| Value | Count | Frequency (%) | |
| 2 | 569108 | 99.9% | |
| 1 | 393 | 0.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 569108 | 99.9% | |
| 1 | 393 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 569501 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 569108 | 99.9% | |
| 1 | 393 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 569501 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 569108 | 99.9% | |
| 1 | 393 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 569501 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 569108 | 99.9% | |
| 1 | 393 | 0.1% |
| Distinct | 367 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2650 |
| Missing (%) | 0.5% |
| Memory size | 4.3 MiB |
| 08/12/2021 | 14154 |
|---|---|
| 08/11/2021 | 12061 |
| 08/13/2021 | 8260 |
| 08/17/2021 | 8180 |
| 08/10/2021 | 7996 |
| Other values (362) |
| Value | Count | Frequency (%) | |
| 08/12/2021 | 14154 | 2.5% | |
| 08/11/2021 | 12061 | 2.1% | |
| 08/13/2021 | 8260 | 1.5% | |
| 08/17/2021 | 8180 | 1.4% | |
| 08/10/2021 | 7996 | 1.4% | |
| 04/13/2021 | 5821 | 1.0% | |
| 08/16/2021 | 5519 | 1.0% | |
| 08/14/2021 | 4945 | 0.9% | |
| 04/14/2021 | 4833 | 0.8% | |
| 08/09/2021 | 4766 | 0.8% | |
| 04/15/2021 | 4550 | 0.8% | |
| 04/12/2021 | 4358 | 0.8% | |
| 04/09/2021 | 4309 | 0.8% | |
| 01/06/2021 | 4273 | 0.8% | |
| 04/08/2021 | 4200 | 0.7% | |
| 04/16/2021 | 4138 | 0.7% | |
| 04/21/2021 | 3872 | 0.7% | |
| 04/07/2021 | 3834 | 0.7% | |
| 04/22/2021 | 3829 | 0.7% | |
| 04/20/2021 | 3799 | 0.7% | |
| 04/19/2021 | 3593 | 0.6% | |
| 04/23/2021 | 3541 | 0.6% | |
| 04/01/2021 | 3492 | 0.6% | |
| 01/27/2021 | 3487 | 0.6% | |
| 04/27/2021 | 3477 | 0.6% | |
| Other values (342) | 431564 | 75.8% |
Frequencies of value counts
Unique
| Unique | 59 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.967427625 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 1414465 | 24.9% | |
| 0 | 1343122 | 23.7% | |
| / | 1133702 | 20.0% | |
| 1 | 930683 | 16.4% | |
| 3 | 157893 | 2.8% | |
| 4 | 155029 | 2.7% | |
| 8 | 152434 | 2.7% | |
| 5 | 113481 | 2.0% | |
| 6 | 102127 | 1.8% | |
| 7 | 89881 | 1.6% | |
| 9 | 75693 | 1.3% | |
| n | 5300 | 0.1% | |
| a | 2650 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4534808 | 79.9% | |
| Other Punctuation | 1133702 | 20.0% | |
| Lowercase Letter | 7950 | 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 1414465 | 31.2% | |
| 0 | 1343122 | 29.6% | |
| 1 | 930683 | 20.5% | |
| 3 | 157893 | 3.5% | |
| 4 | 155029 | 3.4% | |
| 8 | 152434 | 3.4% | |
| 5 | 113481 | 2.5% | |
| 6 | 102127 | 2.3% | |
| 7 | 89881 | 2.0% | |
| 9 | 75693 | 1.7% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1133702 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 5300 | 66.7% | |
| a | 2650 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 5668510 | 99.9% | |
| Latin | 7950 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 1414465 | 25.0% | |
| 0 | 1343122 | 23.7% | |
| / | 1133702 | 20.0% | |
| 1 | 930683 | 16.4% | |
| 3 | 157893 | 2.8% | |
| 4 | 155029 | 2.7% | |
| 8 | 152434 | 2.7% | |
| 5 | 113481 | 2.0% | |
| 6 | 102127 | 1.8% | |
| 7 | 89881 | 1.6% | |
| 9 | 75693 | 1.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 5300 | 66.7% | |
| a | 2650 | 33.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5676460 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 1414465 | 24.9% | |
| 0 | 1343122 | 23.7% | |
| / | 1133702 | 20.0% | |
| 1 | 930683 | 16.4% | |
| 3 | 157893 | 2.8% | |
| 4 | 155029 | 2.7% | |
| 8 | 152434 | 2.7% | |
| 5 | 113481 | 2.0% | |
| 6 | 102127 | 1.8% | |
| 7 | 89881 | 1.6% | |
| 9 | 75693 | 1.3% | |
| n | 5300 | 0.1% | |
| a | 2650 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 569178 |
| Missing (%) | 99.9% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 323 | 0.1% | |
| (Missing) | 569178 | 99.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.998865674 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1138356 | 66.7% | |
| a | 569178 | 33.3% | |
| Y | 323 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1707534 | > 99.9% | |
| Uppercase Letter | 323 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1138356 | 66.7% | |
| a | 569178 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 323 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1707857 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1138356 | 66.7% | |
| a | 569178 | 33.3% | |
| Y | 323 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1707857 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1138356 | 66.7% | |
| a | 569178 | 33.3% | |
| Y | 323 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 462554 |
| Missing (%) | 81.2% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 106947 | 18.8% | |
| (Missing) | 462554 | 81.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.62441857 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 925108 | 61.9% | |
| a | 462554 | 30.9% | |
| Y | 106947 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1387662 | 92.8% | |
| Uppercase Letter | 106947 | 7.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 106947 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 925108 | 66.7% | |
| a | 462554 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1494609 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 925108 | 61.9% | |
| a | 462554 | 30.9% | |
| Y | 106947 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1494609 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 925108 | 61.9% | |
| a | 462554 | 30.9% | |
| Y | 106947 | 7.2% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 499046 |
| Missing (%) | 87.6% |
| Memory size | 4.3 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 70455 | 12.4% | |
| (Missing) | 499046 | 87.6% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.752572866 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 998092 | 63.7% | |
| a | 499046 | 31.8% | |
| Y | 70455 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1497138 | 95.5% | |
| Uppercase Letter | 70455 | 4.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 998092 | 66.7% | |
| a | 499046 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 70455 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1567593 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 998092 | 63.7% | |
| a | 499046 | 31.8% | |
| Y | 70455 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1567593 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 998092 | 63.7% | |
| a | 499046 | 31.8% | |
| Y | 70455 | 4.5% |
| Distinct | 99925 |
|---|---|
| Distinct (%) | 33.7% |
| Missing | 273103 |
| Missing (%) | 48.0% |
| Memory size | 4.3 MiB |
| None | |
|---|---|
| none | |
| NKDA | 7445 |
| NKA | 7351 |
| No | 7169 |
| Other values (99920) |
| Value | Count | Frequency (%) | |
| None | 53465 | 9.4% | |
| none | 25465 | 4.5% | |
| NKDA | 7445 | 1.3% | |
| NKA | 7351 | 1.3% | |
| No | 7169 | 1.3% | |
| Penicillin | 7026 | 1.2% | |
| no | 4035 | 0.7% | |
| Sulfa | 3468 | 0.6% | |
| NONE | 3301 | 0.6% | |
| None known | 2720 | 0.5% | |
| unknown | 2493 | 0.4% | |
| Unknown | 2293 | 0.4% | |
| penicillin | 2036 | 0.4% | |
| No known allergies | 1963 | 0.3% | |
| none known | 1627 | 0.3% | |
| None. | 1590 | 0.3% | |
| Amoxicillin | 1368 | 0.2% | |
| nka | 1350 | 0.2% | |
| Codeine | 1255 | 0.2% | |
| N/a | 1133 | 0.2% | |
| Latex | 1067 | 0.2% | |
| PCN | 1062 | 0.2% | |
| sulfa | 989 | 0.2% | |
| no known allergies | 983 | 0.2% | |
| Sulfa drugs | 978 | 0.2% | |
| Other values (99900) | 152766 | 26.8% | |
| (Missing) | 273103 | 48.0% |
Frequencies of value counts
Unique
| Unique | 93440 ? |
|---|---|
| Unique (%) | 31.5% |
Histogram of lengths of the category
Length
| Max length | 10220 |
|---|---|
| Median length | 3 |
| Mean length | 11.90314152 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1069760 | 15.8% | |
| a | 617240 | 9.1% | |
| 588482 | 8.7% | ||
| e | 551395 | 8.1% | |
| i | 443212 | 6.5% | |
| o | 407791 | 6.0% | |
| l | 335429 | 4.9% | |
| s | 243983 | 3.6% | |
| t | 241872 | 3.6% | |
| r | 234944 | 3.5% | |
| c | 186344 | 2.7% | |
| , | 154874 | 2.3% | |
| d | 142096 | 2.1% | |
| N | 128640 | 1.9% | |
| u | 113249 | 1.7% | |
| p | 103165 | 1.5% | |
| h | 103109 | 1.5% | |
| m | 102929 | 1.5% | |
| g | 81352 | 1.2% | |
| f | 76083 | 1.1% | |
| y | 75751 | 1.1% | |
| A | 58612 | 0.9% | |
| S | 48349 | 0.7% | |
| x | 45134 | 0.7% | |
| P | 43895 | 0.6% | |
| Other values (73) | 581161 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 5340797 | 78.8% | |
| Space Separator | 588482 | 8.7% | |
| Uppercase Letter | 574870 | 8.5% | |
| Other Punctuation | 208538 | 3.1% | |
| Decimal Number | 19159 | 0.3% | |
| Dash Punctuation | 16824 | 0.2% | |
| Open Punctuation | 14713 | 0.2% | |
| Close Punctuation | 14614 | 0.2% | |
| Math Symbol | 714 | < 0.1% | |
| Other Symbol | 102 | < 0.1% | |
| Connector Punctuation | 20 | < 0.1% | |
| Control | 11 | < 0.1% | |
| Modifier Symbol | 6 | < 0.1% | |
| Currency Symbol | 1 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 128640 | 22.4% | |
| A | 58612 | 10.2% | |
| S | 48349 | 8.4% | |
| P | 43895 | 7.6% | |
| C | 36300 | 6.3% | |
| I | 28316 | 4.9% | |
| D | 26641 | 4.6% | |
| E | 24184 | 4.2% | |
| L | 23500 | 4.1% | |
| K | 22227 | 3.9% | |
| O | 19166 | 3.3% | |
| T | 17431 | 3.0% | |
| M | 17372 | 3.0% | |
| B | 13615 | 2.4% | |
| R | 13331 | 2.3% | |
| H | 9765 | 1.7% | |
| F | 8023 | 1.4% | |
| U | 7929 | 1.4% | |
| G | 7824 | 1.4% | |
| V | 7367 | 1.3% | |
| W | 3784 | 0.7% | |
| Z | 3084 | 0.5% | |
| Y | 2681 | 0.5% | |
| X | 1700 | 0.3% | |
| Q | 596 | 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1069760 | 20.0% | |
| a | 617240 | 11.6% | |
| e | 551395 | 10.3% | |
| i | 443212 | 8.3% | |
| o | 407791 | 7.6% | |
| l | 335429 | 6.3% | |
| s | 243983 | 4.6% | |
| t | 241872 | 4.5% | |
| r | 234944 | 4.4% | |
| c | 186344 | 3.5% | |
| d | 142096 | 2.7% | |
| u | 113249 | 2.1% | |
| p | 103165 | 1.9% | |
| h | 103109 | 1.9% | |
| m | 102929 | 1.9% | |
| g | 81352 | 1.5% | |
| f | 76083 | 1.4% | |
| y | 75751 | 1.4% | |
| x | 45134 | 0.8% | |
| v | 39523 | 0.7% | |
| w | 37120 | 0.7% | |
| b | 36914 | 0.7% | |
| k | 33769 | 0.6% | |
| z | 12953 | 0.2% | |
| q | 4240 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 588482 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 154874 | 74.3% | |
| . | 23092 | 11.1% | |
| / | 9964 | 4.8% | |
| ; | 9351 | 4.5% | |
| : | 3817 | 1.8% | |
| ? | 2700 | 1.3% | |
| ' | 1510 | 0.7% | |
| & | 1478 | 0.7% | |
| " | 1096 | 0.5% | |
| * | 296 | 0.1% | |
| # | 172 | 0.1% | |
| ! | 96 | < 0.1% | |
| % | 56 | < 0.1% | |
| \ | 24 | < 0.1% | |
| @ | 12 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 13248 | 90.0% | |
| [ | 1456 | 9.9% | |
| { | 9 | 0.1% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 13190 | 90.3% | |
| ] | 1416 | 9.7% | |
| } | 8 | 0.1% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 16824 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 4393 | 22.9% | |
| 1 | 4283 | 22.4% | |
| 2 | 4118 | 21.5% | |
| 3 | 1159 | 6.0% | |
| 5 | 1159 | 6.0% | |
| 9 | 1078 | 5.6% | |
| 4 | 893 | 4.7% | |
| 6 | 773 | 4.0% | |
| 8 | 681 | 3.6% | |
| 7 | 622 | 3.2% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| = | 351 | 49.2% | |
| > | 165 | 23.1% | |
| + | 163 | 22.8% | |
| ~ | 22 | 3.1% | |
| | | 9 | 1.3% | |
| < | 4 | 0.6% |
Most frequent Other Symbol characters
| Value | Count | Frequency (%) | |
| � | 102 | 100.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 20 | 100.0% |
Most frequent Modifier Symbol characters
| Value | Count | Frequency (%) | |
| ` | 5 | 83.3% | |
| ^ | 1 | 16.7% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 1 | 100.0% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 10 | 90.9% | ||
| 1 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5915667 | 87.3% | |
| Common | 863184 | 12.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1069760 | 18.1% | |
| a | 617240 | 10.4% | |
| e | 551395 | 9.3% | |
| i | 443212 | 7.5% | |
| o | 407791 | 6.9% | |
| l | 335429 | 5.7% | |
| s | 243983 | 4.1% | |
| t | 241872 | 4.1% | |
| r | 234944 | 4.0% | |
| c | 186344 | 3.2% | |
| d | 142096 | 2.4% | |
| N | 128640 | 2.2% | |
| u | 113249 | 1.9% | |
| p | 103165 | 1.7% | |
| h | 103109 | 1.7% | |
| m | 102929 | 1.7% | |
| g | 81352 | 1.4% | |
| f | 76083 | 1.3% | |
| y | 75751 | 1.3% | |
| A | 58612 | 1.0% | |
| S | 48349 | 0.8% | |
| x | 45134 | 0.8% | |
| P | 43895 | 0.7% | |
| v | 39523 | 0.7% | |
| w | 37120 | 0.6% | |
| Other values (27) | 384690 | 6.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 588482 | 68.2% | ||
| , | 154874 | 17.9% | |
| . | 23092 | 2.7% | |
| - | 16824 | 1.9% | |
| ( | 13248 | 1.5% | |
| ) | 13190 | 1.5% | |
| / | 9964 | 1.2% | |
| ; | 9351 | 1.1% | |
| 0 | 4393 | 0.5% | |
| 1 | 4283 | 0.5% | |
| 2 | 4118 | 0.5% | |
| : | 3817 | 0.4% | |
| ? | 2700 | 0.3% | |
| ' | 1510 | 0.2% | |
| & | 1478 | 0.2% | |
| [ | 1456 | 0.2% | |
| ] | 1416 | 0.2% | |
| 3 | 1159 | 0.1% | |
| 5 | 1159 | 0.1% | |
| " | 1096 | 0.1% | |
| 9 | 1078 | 0.1% | |
| 4 | 893 | 0.1% | |
| 6 | 773 | 0.1% | |
| 8 | 681 | 0.1% | |
| 7 | 622 | 0.1% | |
| Other values (21) | 1527 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6778749 | > 99.9% | |
| Specials | 102 | < 0.1% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1069760 | 15.8% | |
| a | 617240 | 9.1% | |
| 588482 | 8.7% | ||
| e | 551395 | 8.1% | |
| i | 443212 | 6.5% | |
| o | 407791 | 6.0% | |
| l | 335429 | 4.9% | |
| s | 243983 | 3.6% | |
| t | 241872 | 3.6% | |
| r | 234944 | 3.5% | |
| c | 186344 | 2.7% | |
| , | 154874 | 2.3% | |
| d | 142096 | 2.1% | |
| N | 128640 | 1.9% | |
| u | 113249 | 1.7% | |
| p | 103165 | 1.5% | |
| h | 103109 | 1.5% | |
| m | 102929 | 1.5% | |
| g | 81352 | 1.2% | |
| f | 76083 | 1.1% | |
| y | 75751 | 1.1% | |
| A | 58612 | 0.9% | |
| S | 48349 | 0.7% | |
| x | 45134 | 0.7% | |
| P | 43895 | 0.6% | |
| Other values (72) | 581059 | 8.6% |
Most frequent Specials characters
| Value | Count | Frequency (%) | |
| � | 102 | 100.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| VAERS_ID | RECVDATE | STATE | AGE_YRS | CAGE_YR | CAGE_MO | SEX | RPT_DATE | SYMPTOM_TEXT | DIED | DATEDIED | L_THREAT | ER_VISIT | HOSPITAL | HOSPDAYS | X_STAY | DISABLE | RECOVD | VAX_DATE | ONSET_DATE | NUMDAYS | LAB_DATA | V_ADMINBY | V_FUNDBY | OTHER_MEDS | CUR_ILL | HISTORY | PRIOR_VAX | SPLTTYPE | FORM_VERS | TODAYS_DATE | BIRTH_DEFECT | OFC_VISIT | ER_ED_VISIT | ALLERGIES | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 916600 | 01/01/2021 | TX | 33.0 | 33.0 | NaN | F | NaN | Right side of epiglottis swelled up and hinder swallowing pictures taken Benadryl Tylenol taken | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 12/28/2020 | 12/30/2020 | 2.0 | None | PVT | NaN | None | None | None | NaN | NaN | 2 | 01/01/2021 | NaN | Y | NaN | Pcn and bee venom |
| 1 | 916601 | 01/01/2021 | CA | 73.0 | 73.0 | NaN | F | NaN | Approximately 30 min post vaccination administration patient demonstrated SOB and anxiousness. Assessed at time of event: Heart sounds normal, Lung sounds clear. Vitals within normal limits for patient. O2 91% on 3 liters NC Continuous flow. 2 consecutive nebulized albuterol treatments were administered. At approximately 1.5 hours post reaction, patients' SOB and anxiousness had subsided and the patient stated that they were feel "much better". | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 12/31/2020 | 12/31/2020 | 0.0 | NaN | SEN | NaN | Patient residing at nursing facility. See patients chart. | Patient residing at nursing facility. See patients chart. | Patient residing at nursing facility. See patients chart. | NaN | NaN | 2 | 01/01/2021 | NaN | Y | NaN | "Dairy" |
| 2 | 916602 | 01/01/2021 | WA | 23.0 | 23.0 | NaN | F | NaN | About 15 minutes after receiving the vaccine, the patient complained about her left arm hurting. She also complained of chest tightness and difficulty swallowing. Patient also had vision changes. We gave the patient 1 tablet of Benadryl 25 mg and called EMS services. EMS checked her out and we advised the patient to go to the ER to be observed and given more Benadryl. Patient was able to walk out of facility herself. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | U | 12/31/2020 | 12/31/2020 | 0.0 | NaN | SEN | NaN | None | None | None | NaN | NaN | 2 | 01/01/2021 | NaN | NaN | Y | Shellfish |
| 3 | 916603 | 01/01/2021 | WA | 58.0 | 58.0 | NaN | F | NaN | extreme fatigue, dizziness,. could not lift my left arm for 72 hours | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 12/23/2020 | 12/23/2020 | 0.0 | none | WRK | NaN | none | kidney infection | diverticulitis, mitral valve prolapse, osteoarthritis | got measles from measel shot, mums from mumps shot, headaches and nausea from flu shot | NaN | 2 | 01/01/2021 | NaN | NaN | NaN | Diclofenac, novacaine, lidocaine, pickles, tomatoes, milk |
| 4 | 916604 | 01/01/2021 | TX | 47.0 | 47.0 | NaN | F | NaN | Injection site swelling, redness, warm to the touch and itchy | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N | 12/22/2020 | 12/29/2020 | 7.0 | NaN | PUB | NaN | Na | Na | NaN | NaN | NaN | 2 | 01/01/2021 | NaN | NaN | NaN | Na |
| 5 | 916605 | 01/01/2021 | TX | 40.0 | 40.0 | NaN | M | NaN | Adverse Events: Inflammation in the eye, confusion, headaches, inflammation in ears, cold chills, shivering, and fever like symptoms Treatments: Primary care physician ran a series of bloodwork and found that after Flu shot I had big drop in white blood cell count and referred me to ophthalmologist and otolaryngologist ophthalmologist prescribed Cequa to treat the inflammation in eyes along with fortified caster oil. otolaryngologist prescribed Prednisone to treat the inflammtion Time course: Still having adverse events | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N | 09/25/2020 | 09/26/2020 | 1.0 | 11/10/2020 Low white blood cell count | UNK | NaN | Kirkland Multivitamin, Kirkland Calcium vitamin, Vitamin D3, Fish Oil | NaN | NaN | NaN | NaN | 2 | 01/01/2021 | NaN | Y | NaN | NaN |
| 6 | 916606 | 01/01/2021 | NV | 44.0 | 44.0 | NaN | F | NaN | patient called back the next day and stated her throat was swelling and had to take Benadryl. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 12/29/2020 | 12/29/2020 | 0.0 | Did not seek medical care. Treated self at home with Benadryl | PVT | NaN | NaN | NaN | NaN | NaN | NaN | 2 | 01/01/2021 | NaN | NaN | NaN | iodine (shellfish) has epipen |
| 7 | 916607 | 01/01/2021 | KS | 50.0 | 50.0 | NaN | M | NaN | SEVERE chills approximately 13-14 hours after receiving vaccine. Even after turning heat up in the house and wrapping myself in two comforters, I was still experiencing severe chills. These chills lasted for approximately 5-6 hours. I was unable to sleep due to them. I did not have a fever, as I checked my temperature several times during this episode. At approximately 6:00 am on the same day as experiencing the chills, I experienced abdominal pains, which lasted approximately 1 hour and resolved on their own. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 12/28/2020 | 12/29/2020 | 1.0 | None | PUB | NaN | Amlodipine, Ambien, Benicar/HCTZ, Invokana, Metformin, Levothyroxine, Bydureon, Metoprolol | None | High blood pressure, high cholesterol, sleep apnea, insomnia, diabetes type II, obesity. | NaN | NaN | 2 | 01/01/2021 | NaN | NaN | NaN | Penicillin |
| 8 | 916608 | 01/01/2021 | OH | 33.0 | 33.0 | NaN | M | NaN | Nasal congestion and diarrhea | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 12/29/2020 | 12/31/2020 | 2.0 | NaN | OTH | NaN | None | None | None | NaN | NaN | 2 | 01/01/2021 | NaN | NaN | NaN | None |
| 9 | 916609 | 01/01/2021 | TN | 71.0 | 71.0 | NaN | F | NaN | On day 9 following the vaccination I noticed a red raised itchy patch at the vaccination site approximately 2 in X 2 in. No other symptoms. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N | 12/23/2020 | 12/31/2020 | 8.0 | None | PUB | NaN | Medication Summary 1/1/21 Name of Medication RX or OTC Doseage Frequency Reason Comment Meloxicam RX 15 mg 1 qd inflammation Synthroid RX 75 mcg. 1 qd, middle of night Thyroid hormone, T4 Liothyronine SOD RX 10 mcg 1 qd, | none | Hashimoto's thyroiditis, Hypertension, depression | NaN | NaN | 2 | 01/01/2021 | NaN | NaN | NaN | Sulfa antibiotics, azithromycin, adhesive in band-aids or tape |
Last rows
| VAERS_ID | RECVDATE | STATE | AGE_YRS | CAGE_YR | CAGE_MO | SEX | RPT_DATE | SYMPTOM_TEXT | DIED | DATEDIED | L_THREAT | ER_VISIT | HOSPITAL | HOSPDAYS | X_STAY | DISABLE | RECOVD | VAX_DATE | ONSET_DATE | NUMDAYS | LAB_DATA | V_ADMINBY | V_FUNDBY | OTHER_MEDS | CUR_ILL | HISTORY | PRIOR_VAX | SPLTTYPE | FORM_VERS | TODAYS_DATE | BIRTH_DEFECT | OFC_VISIT | ER_ED_VISIT | ALLERGIES | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 569491 | 1708043 | 09/17/2021 | GA | 25.0 | 25.0 | NaN | F | NaN | Vaccine had expired on 08/24/2021 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 08/26/2021 | 08/26/2021 | 0.0 | none | PUB | NaN | UNKNOWN. | UNKNOWN. | UNKNOWN | NaN | NaN | 2 | 09/17/2021 | NaN | NaN | NaN | NKA. |
| 569492 | 1708044 | 09/17/2021 | NC | 30.0 | 30.0 | NaN | F | NaN | Description of events from Patient Message sent through our patient portal: "On August 15th I got the Pfizer vaccine. I have been on birth control for probably 4-5 years now and I?ve always had a little nausea if I forget it and then double up to make up for it but never anything weird or out of the ordinary has happened the entire time I?ve been on birth control. I missed two days of birth control, Friday and Saturday, I doubled up on Sunday (22nd) and Monday (23rd). I received my allergy immunology shot on Monday 23rd (I?m building up still, so I get them every Monday). My tongue swelled on Monday 23rd and got incredibly sore to the point where it was painful to speak. Internet said it could be hormonal.. so I called the pharmacy where I got my birth control ask if that can be a hormonal reaction involving birth control, they said no. I contacted my allergist since I received that the same day, before she was in the allergy department she was an OBGYN, and said that it?s more likely to be hormone related since, I had doubled up. Since the swelling started toward the evening, and not following immediately after the shot, she said it?s very likely to be hormone related. I have never had anything weird like that happen, and it was followed by an entire week of miserable nausea. Out of my female family and female friends who received a covid vaccine, most have had hormonal/heavy/clotted menstrual reactions to the covid vaccines. Since hormones effect menstrual cycles, and my tongue swelling was followed by an entire week of incredible nausea, I figured it might be helpful to report." | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 08/15/2021 | 08/23/2021 | 8.0 | NaN | PVT | NaN | Yaz , Zyrtec, Fluticasone | NaN | GAD | NaN | NaN | 2 | 09/17/2021 | NaN | NaN | NaN | Percocet |
| 569493 | 1708045 | 09/17/2021 | WI | 46.0 | 46.0 | NaN | F | NaN | Random period in the middle of birth control cycle But your system doesn?t even know it is JOHNSON & JOHNSON - NOT JANSSEN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | U | 09/02/2021 | 09/15/2021 | 13.0 | NaN | PHM | NaN | Estrylla (BCP) Methimazole ((2.5 mg 4x/week) | NaN | Graves Disease | NaN | NaN | 2 | 09/17/2021 | NaN | NaN | NaN | Peas, fish |
| 569494 | 1708046 | 09/17/2021 | OR | 38.0 | 38.0 | NaN | F | NaN | Patient reports large, wide-spread, pruritic hives about 10 minutes after the injection. She had no other symptoms. She self-treated with and old prescription for prednisone intermittently for about 12 days. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 09/03/2021 | 09/03/2021 | 0.0 | None. | PVT | NaN | Dicyclomine; Cyclobenzaprine; Lisinopril; Albuterol; Ibuprofen. | Recent COVID infection. | Hypertension; Thrombocytosis; Mild intermittent asthma; Depression; Dyslipidemia; Migraine with aura; Chronic hives. | NaN | NaN | 2 | 09/17/2021 | NaN | Y | NaN | Medroxyprogesterone; Adhesive tape; Aripiprazole; Estradiol-testosterone; Latex; Lithium; Oxycodone; Transparent dressings; Quetiapine; Ziprasidone. |
| 569495 | 1708047 | 09/17/2021 | GA | 56.0 | 56.0 | NaN | M | NaN | Basically: It seems relatively likely there is some neuropathy that may be piriformis/IT related but I cant assess it fully w the PE burden, weight, and his knee safety (and even if I could be cant do the stretches right now) but I would guess its neuropathy w some piriformis tightness worsening it. Then large burden PEs, many, days later, negative covid x2, up to date cancer screenings, no fhx PE/lupus/hypercoag/cancer. 56yo morbid obesity and R knee fx, presented here w SOB and found to have multiple large Pes. Story as follows: recieved second dose moderna 9/1, 9/2 he had tingling in his R knee w some swelling and joint pains more diffuse. He has baselie chronic R knee pain after accident at six flags w acl tear, meniscus tear, and fx after the boat ride accident in 2016 and he uses a L handed cane since. This progressed and by 9/3 there was R hip pain, and tingling, by 9/5 tingling in both LE b/l and weakness, the toes felt numb and the R hip pain was worse. He went to an ED and was palced on muscle relaxant steroid injection and po steroid, no benefit, came to our ed and had ct a few days after negative again steroid injection given. 2 days ago sob worsened and he had xray at medstop, sent home. Presented here elevated DD and significant PE burden. He has some LE swelling at baseline but this is more than usual and he is easy out of breath more than usual. Normal colonscopy 6 months ago anf no fhx of cancers or blood clots. | NaN | NaN | Y | NaN | Y | NaN | NaN | Y | N | 09/01/2021 | 09/02/2021 | 1.0 | CTA w large burden b/l PEs | PVT | NaN | Ibuprofen | NaN | Morbid obesity (52) Chronic R knee traumatic injury (2006) New onset DM2 (a1c 7.7, dx in hospital) | NaN | NaN | 2 | 09/17/2021 | NaN | Y | Y | None |
| 569496 | 1708048 | 09/17/2021 | ME | 33.0 | 33.0 | NaN | F | NaN | Night of shot: Extreme fatigue, muscle aches left side of chest, back, and neck Day 2: Extreme fatigue, muscle aches, joint ache, headache, felt similar to a flu/hangover Day 3&4: Extreme fatigue, muscle aches, joint ache, headache, felt similar to a flu/hangover, ear ache, lymphnodepathy front neck lympthnodes. Hurt to put on deodorant left arm Day 5:Extreme fatigue, muscle aches, joint ache, severe headache, felt similar to a flu/hangover Day 6 &7: fatigue, headache, inner ear pain, increased mucosa production Day 8, 9, 10: increased mucosa production, sore throat resulting in cough and painful to ingest fluids or solids. Failed all exams during this period due to impaired cognitive abilities. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N | 09/07/2021 | 09/07/2021 | 0.0 | Not everyone has health insurance or can afford health insurance. | PHM | NaN | NaN | NaN | NaN | NaN | NaN | 2 | 09/17/2021 | NaN | NaN | NaN | Erythromycin, sulfa, penicillin |
| 569497 | 1708049 | 09/17/2021 | IL | 49.0 | 49.0 | NaN | F | NaN | I started to break out in a rash about 2 hours after the shot. By morning, it covered a large portion of behind my arm, back of legs, and knees. I also had a racing pulse of over 110, severe headache, leg and foot cramping, loss of appetite, fever, a foggy feeling where I could not speak properly or keep my thoughts in line. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N | 09/10/2021 | 09/10/2021 | 0.0 | There was no infection, no virus. Blood work came back normal. Only showed signs of dehydration. | PHM | NaN | Glipizide, Metformin, Losartan, Atorvastatin, Sertraline, Gabapentin. | None. | Diabetic; Partial Thyroid Removal due to nodule. | NaN | NaN | 2 | 09/17/2021 | NaN | Y | Y | Demerol; Pineapple; Seafood/Shellfish. |
| 569498 | 1708050 | 09/17/2021 | NY | NaN | NaN | NaN | U | NaN | Cold sweats mostly at night sometimes during the day, body aches, joint pain, burning sensation in stomach mostly at night. Symptoms generally come in waves and are at times more tolerable than others. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | U | NaN | NaN | NaN | NaN | PHM | NaN | NaN | NaN | NaN | NaN | NaN | 2 | 09/17/2021 | NaN | NaN | NaN | NaN |
| 569499 | 1708052 | 09/17/2021 | ME | 42.0 | 42.0 | NaN | F | NaN | My husband had the shot and 2 days after I started my period again after it had just ended 3 days before his shot and it lasted for a full month. I also had night sweats, chills, body aches, tired all the time, as well as insomnia. This lasted for a full month. I tried to report to my doctor, but they were in denial that this could be vaccine shedding. A co-workers husband who also works at the SY and got the the J&J shot suffered the same affects as I did. | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Y | 06/10/2021 | 06/12/2021 | 2.0 | None. Dr's office was in denial after I reported issues related to vaccine shedding. | MIL | NaN | flonase, claritin. | none | adenomyosis, chronic sinusitis, arthritis, migraines | NaN | NaN | 2 | 09/17/2021 | NaN | NaN | NaN | NaN |
| 569500 | 1708053 | 09/17/2021 | FL | 68.0 | 68.0 | NaN | M | NaN | My right thumb now "SHAKES" when I use it and my index finger together | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | N | 02/07/2021 | 03/18/2021 | 39.0 | Doctor advise me to report it. | OTH | NaN | None | Afib | Afib | NaN | NaN | 2 | 09/17/2021 | NaN | NaN | NaN | None |